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2) INFORMATION FOR SEQ ID NO: 158 

5 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Salmonella choleraesuis subsp. arizonae 

(B) STRAIN: ATCC 13314 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158 

20 CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGC CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 200 

ATACGCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

25 GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ACTCTTACAT 300 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 400 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

30 ACGAAGGCCG TGCCGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAAT, CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTGGAGAT 750 

35 GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGTCTG CGTTTCGCAA 830 



40 2) INFORMATION FOR SEQ ID NO: 159 



< 1 ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Choleraesuis 

(B) STRAIN: ATCC 7001 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159 

CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGYCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

60 TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 2 00 

96 
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ACACGCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGTGACGCA 250 

GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 3 00 

TCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 4 00 

5 CGCGGTATCA TCAAAGTGGG YGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGTTGG 500 

ACGAAGGCCG TGCCGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

10 GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCAATGGA CGACGGTCTG CGTTTCGCAA TC 832 



15 



30 



2) INFORMATION FOR SEQ ID NO: 160 



{ i ) SEQUENCE CHARACTERISTICS : 
2 0 (A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. diarizonae 

(B) STRAIN: ATCC 43973 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160 



CCTGGTTGTT GCTGCGACTG ACGGCCCGAT* GC CGGAGACC CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GfiCGTTCCGT ACATTATCGT GTTCCTGAAC 100 

35 AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TAGAAATGGA 150 

AGTTCGTGAA CTTCTGTCTC AGTACGACTT CCCGGGCGAC GACACGC CAA 2 00 

TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGACGC AGAGTGGGAA 250 

GCGAAAATCA TCGAACTGGC TGGCTTCCTG GATTCTTACA TCCCGGAACC 3 00 

AGAGCGTGCG ATTGACAAGC CGTTCCTGCT GCCGATCGAA GACGTATTCT 350 

40 CCATCTCCGG TCGTGGTACC GTTGTTACCG GTCGTGTAGA GCGCGGTATC 4 00 

ATCAAAGTGG GCGAAGAAGT TGAAATCGTT GGTATCAAAG AGACTCAGAA 450 

GTCTACCTGT ACTGGCGTTG AAATGTTCCG CAARCTGCTG GACGAAGGCC 500 

GTGCWGGTGA GAACGTAGGT GTTCTGCTGC GTGGTATCAA ACGTGAAGAA 550 

ATCGAACGTG GTCAGGTACT GGCTAAGCCG GGCACCATCA AGCCGCACAC 600 

45 CAAGTTCGAA TCTGAAGTGT ATATTCTGTC CAAAGATGAA GGCGGCCGTC 650 

ATACTCCGTT CTTCAAAGGC TACCGTCCGC AGTTCTACTT CCGTACCACT 700 

GACGTGACTG GCACCATCGA ACTGCCGGAA GGCGTGGAGA TGGTAATGCC 750 

GGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCG ATCGCGATGG 800 

ACGACGG 807 

50 



2) INFORMATION FOR SEQ ID NO: 161 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
5 serotype Heidelberg 

(B) STRAIN: ATCC 8326 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161 

10 CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 200 

ACACTCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

15 GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 300 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 4 00 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

20 ACGAAGGCCG TGCCGGTGAG AACGTAGGTG TTC TGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA YACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG CACCATCGAA TTGCCGGAAG GCGTAGAGAT 750 

25 GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGTCTG CGTTTCGCAA TC 832 



30 2) INFORMATION FOR SEQ ID NO: 162 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 807 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. houtenae 

(B) STRAIN: ATCC 43974 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162 

45 

CCTGGTTGTT GCTGCGACTG 

TCCTGCTGGG TCGTCAGGTA 

AAATGCGACA TGGTTGATGA 

AGTTCGTGAA CTTCTGTCTC 
50 TCGTGCGTGG TTCTGCTCTG 

GCGAAAATCA TCGAACTGGC 

AGAGCGTGCG ATTGACAAAC 

CTATCTCCGG TCGTGGTACC 

ATCAAAGTGG GCGAAGAAGT 
55 GTCGACCTGT ACTGGCGTTG 

GTGCTGGCGA GAACGTAGGT 

ATCGAACGTG GTCAGGTACT 

CAAGTTCGAA TCTGAAGTGT 

ATACTCCGTT CTTCAAAGGC 
60 . GACGTGACTG GCACCATCGA 
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ACGGCCCGAT GCCGCAGACC CGTGAGCACA 50 

GGCGTTCCGT ACATCATCGT GTTCCTGAAC 100 

CGAAGAGCTG CTGGAACTGG TAGAAATGGA 150 

AGTACGATTT CCCGGGCGAC GACACGCCGA 200 

AAAGCGCTGG AAGGCGACGC AGAATGGGAA 250 

TGGCTACCTG GATTCTTACA TCCCGGAACC 300 

CGTTCCTGCT GCCGATCGAA GACGTATTCT 350 

GTTGTTACCG GTCGTGTAGA GCGCGGTATC 400 

TGAAATCGTT GGTATCAAAG AGACTCAGAA 450 

AAATGTTCCG CAAACTGCTG GACGAAGGCC 500 

GTTCTGCTGC GTGGTATCAA ACGTGAAGAA 550 

GGCTAAGCCG GGCACCATCA AGCCGCACAC 600 

ACATTCTGTC CAAAGATGAA GGCGGCCGTC 650 

TACCGTCCGC AATTCTACTT CCGTACGACT 700 

ACTGCCGGAA GGCGTGGAGA TGGTAATGCC 750 
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GGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCG ATCGCGATGG 800 
ACGACGG 807 



2) INFORMATION FOR SEQ ID NO: 163 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 827 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella, choleraesuis subsp. indica 

(B) STRAIN: ATCC 4 3 976 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163 

CGCGATCCTG GTTGTTGCTG CGACTGACGG CCCGATGCCG CAGACCCGTG 50 

AGCACATCCT GCTGGGTCGT CAGGTAGGCG TTCCGTACAT CATCGTGTTC 100 

CTGAACAAAT GCGACATGGT TGATGACGAA GAGCTGCTGG AACTGGTTGA 150 

25 AATGGAAGTT CGTGAACTTC TGTCTCAGTA CGACTTCCCG GGTGACGACA 200 

CGCCGATCGT GCGTGGTTCT GCTCTGAAAG CGCTGGAAGG CGACGCAGAG 250 

TGGGAAGCGA AAATCATCGA ACTGGCTGGC TTCCTGGATT CTTACATTCC 300 

GGAACCAGAG CGTGCGATTG ACAAGCCGTT CCTGCTGCCG ATCGAAGACG 3 50 

TATTCTCCAT CTCCGGTCGT GGTACYGTTG TTACCGGTCG TGTAGAGCGC 4 00 

30 GGTATCATCA AAGTGGGCGA AGAAGTTGAA ATCGTTGGTA TCAAAGAGAC 450 
TCAGAAGTCT ACCTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGACG - 500 

AAGGCCGTGC CGGTGAGAAC GTAGGTGTTC TGCTGCGTGG TATCAAACGT 550 

GAAGAAATCG AACGTGGTCA GGTACTGGCT AAGCCGGGCA CCATCAAGCC 600 

GCACACCAAG TTCGAATCTG AAGTGTACAT TCTGTCCAAA GATGAAGGCG 650 

35 GCCGTCATAC TCCGTTCTTC AAAGGCTACC GTCCGCAGTT CTACTTCCGT 700 

ACTACTGACG TGACTGGCAC CATCGAACTG CCGGAAGGCG TAGAGATGGT 750 

AATGCCGGGC GACAACATCA AAATGGTTGT TACCCTGATC CATCCGATCG 800 

CRATGGACGA CGGTCTGCGT TTCGCAA 827 



40 



55 



2) INFORMATION FOR SEQ ID NO: 164 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. salamae 

(B) STRAIN: ATCC 43972 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164 



CCTGGTTGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACC CGTGAGCACA 50 
TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 100 
60 AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TAGAAATGGA 150 

99 
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AGTTCGTGAA CTTCTGTCTC AGTACGACTT 

TCGTGCGTGG TTCCGCTCTG AAAGCGCTGG 

GMGAAAATCA TCGAACTGGC TGGCTWCCTG 

AGAGCGTGCG ATTGACAAGC CGTTCCTGCT 

5 CCATCTCCGG TCGTGGTACC GTTGTTACCG 

ATCAAAGTGG GCGAAGAAGT TGAAATCGTT 

GTCTACCTGT ACTGGCGTTG AAATGTTCCG 

GTGCCGGTGA GAACGTAGGT GTTCTGCTGC 

ATCGAACGTG GTCAGGTACT GGCTAAGCCG 

10 CAAGTTCGAA TCTGAAGTGT ACATTCTGTC 

ATACTCCGTT CTTCAAAGGC TACCGTCCGC 
GACGTGACTG GCAC CATCGA ACTGCCGGAA 

GGGCGACAAC ATCAAAATGG TTGTTACCCT 
ACGACGG 

15 



2) INFORMATION FOR SEQ ID NO: 165 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella 
30 serotype Typhi 

(B) STRAIN: ATCC 10749 : 

(xi) SEQUENCE DESCRIPTION: SEQ'< 

35 CGGCGCGATC CTGGTTGTTG CTGCGACTGA 
GTGAGCACAT CCTGCTGGGT CGTCAGGTAG 
TTCCTGAACA AATGCGACAT GGTTGATGAC 
TGAAATGGAA GTTCGTGAAC TTCTGTCTCA 
ACACGCCGAT CGTTCGTGGT TCTGCTCTGA 

40 GAGTGGGAAG CGAAAATCAT CGAACTGGCT 
CCCGGAACCA GAGCGTGCGA TTGACAAGCC 
ACGTATTCTC CATCTCCGGT CGTGGTACCG 
CGCGGTATCA TCAAAGTGGG CGAAGAAGTT 
GACTCAGAAG TCTACCTGTA CTGGCGTTGA 

45 ACGAAGGCCG TGCNGGTGAG AACGTAGGTG 
CGTGAAGAAA TCGAACGTGG TCAGGTACTG 
GCCGCACACY AAGTTCGAAT CTGAAGTGTA 
GCGGCCGTCA TACTCCGTTC TTCAAAGGCT 
CGTACTACTG ACGTGACTGG CACCATCGAA 

50 GGTAATGCCG GGCGACAACA TCAAAATGGT 
TCGCAATGGA CGACGGTCTG CGTTTCGCAA 



55 2) INFORMATION FOR SEQ ID NO: 166 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

1 





pjvpA Henoch 


















CI a CH T a TTPT 






PPPPPPTSTP 




GGTATCAAAG 


AGAC TCAGAA 


450 


CAAACTGCTG 


GACGAAGGCC 


500 


GTGGTATCAA 


ACGTGAAGAA 


550 


GGCACCATCA 


AGCCGCACAC 


600 


CAAAGATGAA 


GGCGGCCGTC 


650 


AGTTCTACTT 


CCGTACCACT 


700 


GGCGTGGAGA 


TGGTAATGCC 


750 


GATCCACCCG 


ATCGCGATGG 


800 
807 



choleraesuis subsp. choleraesuis 



NO: 165 



CGGCCCGATG 


CCGCAGACCC 


50 


GCGTTCCGTA 


CATCATCGTG 


100 


GAAGAGCTGC 


TGGAACTGGT 


150 


GTACGACTTC 


CCGGGCGACG 


200 


AAGCGCTGGA 


AGGCGACGCA 


250 


GGCTTCCTGG 


ATTCTTACAT 


300 


GTTCCTGCTG 


CCGATCGAAG 


350 


TTGTTACCGG 


TCGTGTAGAA 


400 


GAAATCGTTG 


GTATCAAAGA 


450 


AATGTTCCGC 


AAACTGCTGG 


500 


TTCTGCTGCG 


TGGTATCAAA 


550 


GCTAAGCCGG 


GCAC CATCAA 


600 


CATTCTGTCC 


AAAGATGAAG 


650 


ACCGTCCGCA 


GTTCTACTTC 


700 


CTGCCGGAAG 


GCGTAGAGAT 


750 


TGTTACCCTG 


ATCCACCCGA 


800 


TC 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia fonticola 

(B) STRAIN: DSM 4576 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166 



10 



CGGCGCTATC CTGGTTGTAG CTGCGACTGA CGGCCCTATG CCTCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTTG GCGTTCCTTA CATCATCGTA 100 

TTCATGAACA AATGCGACAT GGTTGATGAT GAAGAGCTGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTGTCTGC TTATGACTTC CCTGGTGATG 200 

15 ACCTGCCGGT TGTTCGTGGT TCAGCGCTGA AAGCACTGGA AGGCGAAGCT 250 

GAGTGGGAAG CTAAAATCAT CGAGCTGGCC GGTCACCTGG ATTCCTACAT 3 00 

CCCAGAACCA GAGCGTGCTA TCGATCAGCC GTTCCTGCTG CCAATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TAGTTACCGG TCGTGTTGAG 400 

CGCGGTATCG TTAAAGTTGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

20 CACCGTTAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAG 550 

CGTGAAGACA TCGAACGTGG TCAGGTACTG GCTAAACCAG GTTCCATCAA 600 

GCCGCACACT CAGTTCGATT CAGAAGTGTA TATCCTGAGC AAAGAAGAAG 650 

GTGGTCGTCA TACTCCATTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 700 

25 CGTACAACTG ACGTGACCGG TAC CATCGAA CTGCCAGAAG GCGTAGAGAT 750 

GGTAATGCCA GGCGATAACG TGAACATGGT TGTTACCCTG ATCCACCCAA 800 

TCGCTATGGA CCAAGGC 817 



30 



40 



2) INFORMATION FOR SEQ ID NO: 167 



■(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 787 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia liquefaciens 

(B) STRAIN: ATCC 27592 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167 

GCTGCGACTG ACGGCCCAAT GCCTCAGACC CGTGAGCACA TCCTGCTGGG 50 

TCGTCAGGTT GGCGTTCCTT TCATCATCGT ATTCATGAAC AAATGCGACA 100 

TGGTTGATGA TGAAGAGCTG CTGGAACTGG TAGAAATGGA AGTTCGTGAA 150 

50 CTTCTGTCTG CTTACGACTT CCCTGGTGAT GACCTGCCGG TTGTTCGTGG 200 

TTCAGCGCTG AAAGCACTGG AAGGCGAAGC TGAGTGGGAA GCTAAAATCA 250 

TCGAGCTGGC CGGTTACCTG GATTCTTACA TCCCAGAACC AGAGCGTGCT 300 

ATCGACAAGC CGTTCCTGCT GCCAATCGAA GACGTCTTCT CCATCTCCGG 350 

TCGTGGTACC GTTGTTACCG GTCGTGTTGA GCGCGGTATC GTTAAAGTTG 400 

55 GCGAAGAAGT TGAAATCGTT GGTATCAAAG ACACCGTTAA GTCTACCTGT 450 

ACTGGCGTTG AAATGTTCCG CAAACTGCTG GACGAAGGCC GTGCTGGTGA 500 

GAACGTTGGT GTTCTGCTGC GTGGTATCAA GCGTGAAGAC ATCGAACGTG 550 

GTCAGGTACT GGCTAAACCA GGTTCAATCA AGCCACACAC CAAGTTCGAC 600 

TCAGAAGTGT ACATCCTGAG CAAAGAAGAA GGTGGTCGTC ATACTCCATT 650 

60 CTTCAAAGGC TACCGTCCAC AGTTCTACTT CCGTACAACT GACGTGACCG 700 

101 
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GTACCATCGA ACTGCCAGAA GGCGTTGAAA TGGTAATGCC AGGTGACAAC 
GTGAACATGG TTGTTACCCT GATCCACCCA ATCGCGA 



750 
787 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 168 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 74 5 bases 
•(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia marcescens 

(B) STRAIN: ATCC 13880 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168 

GCCTCAGACT CGTGAGCACA TCCTGCTGGG TCGTCAGGTT GGCGTTCCTT 50 

TCATCATCGT ATTCATGAAC AAATGCGACA TGGTTGATGA TGAAGAGCTG 100 

YTGGAACTGG TAGAAATGGA AGTTCGCGAA CTGCTGTCCG CTTACGACTT 150 

CCCTGGCGAC GACCTGCCGG TAATCCGCGG TTCCGCGCTG AAAGCGCTGG 200 

AAGGCGAAGC TGAGTGGGAA GCGAAAATCA TCGAACTGGC CGAAGCCCTG 2 50 

GACAGCTACA TCCCAGAGCC AGAGCGTGCT ATCGACAAGC CGTTCCTGCT 300 

GCCAATCGAA GACGTATTCT CCATCTCCGG TCGTGGTACC GTTGTTACCG 3 50 

GTCGTGTTGA GCGCGGCATC ATCAAAGTTG GCGAAGAAGT TGAAATCGTT 4 00 

GGTATCAAAG ACACCGTTAA GTCTACCTGT ACTGGCGTTG AAATGTTCCG 4 50 

CAAACTGCTG GACGAAGGCC GTGCTGGTGA GAACGTAGGT GTTCTGCTGC 500 

GTGGTATCAA ACGTGAAGAA ATCGAACGTG GTCAGGTACT GGCTAAGCCA 550 

GGCTCCATCA AGCCGCACAC CCAGTTCGAA TCTGAAGTGT ACATCCTGAG 600 

CAAAGATGAA GGTGGTCGTC ACACKCCATT CTTCAAAGGC TACCGTCCAC 650 

AGTTCTACTT CCGTACCACT GACGTGACCG GTACCATCGA ACTGCCAGAA 700 

GGCGTAGAGA TGGTAATGCC AGGCGACAAC GTGAACATGG TTGTA 745 



2) INFORMATION FOR SEQ ID NO: 169 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia odor if era 

(B) STRAIN: ATCC 33077 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169 

GGCGCAATCC TGGTTGTTGC TGCGACTGAC GGCCCTATGC CTCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GCCAGGTTGG CGTTCCTTTC ATCATCGTGT 100 

TCATGAACAA ATGTGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TTCGCGAGCT GCTGTCTGCT TACGATTTCC CTGGCGACGA 200 

CCTGCCAGTA ATCCGCGGTT CTGCGCTGAA AGCGCTGGAA GGCGAAGCAG 250 
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AGTGGGAAGC TAAGATTGTA GAACTGGCTG 
CCAGAACCAG AGCGTGCTAT CGACAAGCCG 
CGTATTCTCC ATCTCCGGTC GTGGTACCGT 
GCGGTATCAT CAAAGTTGGC GAAGAAGTTG 
5 ACCGTTAAGT CTACCTGTAC CGGTGTAGAA 
CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT 
GTGAAGACAT CGAACGTGGT CAGGTTCTGG 
CCGCACACCA AATTCGACTC AGAAGTGTAC 
TGGTCGTCAC ACGCCATTCT TCAAAGGCTA 
10 GTACTACTGA CGTGACCGGT ACCATCGAAC 
GTAATGCCAG GCGATAACGT GAACATGGTT 
CGCAATGGAC GACGGTCTGC GTTTCGCAA 



AAGCGCTGGA TTCTTACATC 300 

TTCCTGCTGC CAATCGAAGA 350 

TGTTACCGGT CGTGTTGAGC 400 

AAATCGTTGG TATCAAAGAC 450 

ATGTTCCGCA AACTGCTGGA 500 

TCTGCTGCGT GGTATCAAGC 550 

CTAAACCAGG TTCTATCAAG 600 

ATCCTGAGCA AAGAAGAAGG 650 

CCGTCCACAG TTCTACTTCC 700 

TGCCAGAAGG CGTAGAGATG 750 

GTTACCCTGA TTCACCCAAT 800 

829 



15 



2 ) INFORMATION FOR SEQ ID NO: 170 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 830 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia plymuthica 

(B) STRAIN: DSM 4540 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170 

* > 

CGGCGCAATC CTGGTTGTTG CTGCGACTGA CGGCCCAATG CCTCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTTG GCGTTCCTTT CATCATCGTA 100 

TTCATGAACA AATGCGACAT GGTTGATGAT GAAGAGCTGC TGGAACTGGT 150 

35 AGAAATGGAA GTTCGTGAAC TTCTGTCTGC TTACGACTTC CCTGGTGATG 200 

ACCTGCCGGT TGTTCGTGGT TCAGCGCTGA AAGCACTGGA AGGCGAACCA 250 

GAGTGGGAAG CTAAAATCAT CGAGCTGGCT GGTTTCCTGG ATTCTTACAT 300 

CCCAGAACCA GAGCGTGCTA TCGACAAGCC GTTCCTGCTG CCAATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTTGAG 400 

40 CGCGGTATCG TTAAAGTTGG CGAAGAAGTT GAAATCGTGG GTATCAAAGA 450 

CACCGTTAAG TCTACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTGGGTG TTCTGCTGCG TGGTATCAAG 550 

CGCGAAGATA TCGAACGTGG TCAGGTCCTG GCTAAACCAG GTTCAATCAA 600 

GCCACACACC AAGTTTGACT CAGAAGTGTA CATCCTGAGC AAAGAAGAAG 650 

45 GTGGTCGTCA TACTCCATTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTGACCGG TACCATCGAA CTGCCAGAAG GCGTAGAGAT 750 

GGTAATGCCA GGTGACAACG TGAACATGGT TGTAACCCTG ATCCACCCAA 800 

TCGCGATGGA CGACGGCCTG CGTTTCGCAA 830 

50 

2) INFORMATION FOR SEQ ID NO: 171 

( i ) SEQUENCE CHARACTER I STI CS : 
55 (A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

SO (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia rubidaea 

(B) STRAIN: ATCC 27593 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171 

GGCGCAATCC TGGTAGTAGC AGCGACTGAC GGCCCAATGC CTCAGACCCG 50 

TGAGCACATC CTGCTGGGCC GCCAGGTAGG CGTACCTTAC ATCATCGTAT 100 

10 TCATGAACAA ATGCGACATG GTAGATGATG AAGAGCTGCT GGAACTGGTA 150 

GAGATGGAAG TTCGCGAACT GCTGTCTGCT TACGACTTCC CAGGCGACGA 200 

CCTGCCGGTA ATCCGTGGTT CCGCGCTGAA AGCGCTGGAA GGCGAAGCCG 250 

AGTGGGAAGC GAAAATCGTT GAGCTGGCAG AAGCGCTGGA CAGCTACATC 300 

CCAGAGCCAG AGCGTGCTGT AGACAAGCCG TTCCTGCTGC CAATCGAAGA 350 

15 CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

GCGGTATCAT CAAAGTTGGT GAAGAAGTAG AAATCGTAGG TATCAAAGAC 450 

ACCGTTAAGT CTACCTGTAC TGGCGTAGAA ATGTTCCGCA AACTGC TGGA 500 

CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAGC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CGAAGCCAGG TTCAATCAAG 600 

20 CCGCACACCC AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

GTACAACTGA CGTGACCGGT ACCATCGAAC TGCCAGAAGG CGTAGAGATG 750 

GTAATGCCAG GCGACAACGT GAACATGAAA GTTACTCTGA TTCACCCAAT 800 

CGCAATGGAC GACGGTCTGC GTTTCGCAA 829 

25 



2) INFORMATION FOR SEQ ID NO: 172 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

v , (C) STRANDEDNESS : Double 
(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella boydii 
40 (B) STRAIN: ATCC 9207 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172 

CGGCGCGATC YTGGTAGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

45 GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 200 

ACACTCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

GAGTGGGAAG CGAAAATCCT GGAACTGGCT GGCTTCCTGG ATTCTTACAT 300 

50 TCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 4 00 

CGCGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

55 CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG TACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

50 TCGCGATGGA CGACGGTCTG CGTTTC 826 
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2) INFORMATION FOR SEQ ID NO: 173 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella dysenteriae 

(B) STRAIN: ATCC 11835 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173 

TGGTAGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG TGAGCACATC 50 

CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT TCCTGAACAA 100 

ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT GAAATGGAAG 150 

TTCGTGAACT TCTGTCTCAG TACGACTTCC CGGGCGACGA CACTCCGATC 200 

GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG AGTGGGAAGC 250 

GAAAATCCTG GAACTGGCTG GCTTCCTGGA TTCYTAYATT CCGGAACCAG 300 

AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA CGTATTCTCC 350 

ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC GCGGTATCAT 400 

CAAAGTTGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAG ACYCAGAAGT 450 

CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA CGAAGGCCGT 500 

GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC GTGAAGAAAT 550 

CGAACGTGGT CAGGTACTGG CGAAGCCRGG CACCATCAAG CCGCACACCA 600 

AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG CGGCCGTCAT 650 

ACTCCGTTCT TCAAAGGGTA CCGTCCGCAG TTCTACTTCC GTAC TACTGA 700. 

CGTGACTGGT ACCATCGAAC TGCCGGAAGG CGTAGAGATG GTAATGCCGG 750 

GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT CGCGATGGAC 800 

GACGGTCTGC GTTTCGCA 818 



2) INFORMATION FOR SEQ ID NO: 174 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella flexneri 

(B) STRAIN: ATCC 12022 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174 

CCTGGTAGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACT CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 100 

AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TTGAAATGGA 150 

AGTTCGTGAA CTTCTGTCTC AGTACGACTT CCCGGGCGAC GACACTCCGA 200 

TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGACGC AGAGTGGGAA 250 
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GCGAAAATCC 
AGAGCGTGCG 
CCATCTCCGG 
ATCAAAGTTG 
GTCTACCTGT 
GTGCTGGTGA 
ATCGAACGTG 
CAAGTTCGAA 
ATACTCCGTT 
GACGTGACTG 
GGGCGACAAC 
ACGACG 



TGGAACTGGC 
ATTGACAAGC 
TCGTGGTACC 
GTGAAGAAGT 
ACTGGCGTTG 
GAACGTAGGT 
GTCAGGTACT 
TCTGAAGTGT 
CTTCAAAGGC 
GTACCATCGA 
ATCAAAATGG 



TGGCTTCCTG 
CGTTCCTGCT 
GTTGTTACCG 
TGAAATCGTT 
AAATGTTCCG 
GTTCTGCTGC 
GGCTAAGCCG 
ACATTCTGTC 
TACCGTCCGC 
ACTGCCGGAA 
TTGTTACCCT 



GATTCTTACA TTCCGGAACC 
GCCGATCGAA GACGTATTCT 
GTCGTGTAGA ACGCGGTATC 
GGTATCAAAG AGACTCAGAA 
CAAACTGCTG GACGAAGGCC 
GTGGTATCAA ACGTGAAGAA 
GGCACCATCA AGCCGCACAC 
CAAAGATGAA GGCGGMCGTC 
AGTTCTACTT CCGTACTACT 
GGCGTAGAGA TGGTAATGCC 
GATCCACCCG ATCGCGATGG 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
806 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 175 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella aonnei 

(B) STRAIN: ATCC 2 9930 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175 

CGGCGCGATC CTGGTAGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATG CGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 200 

ACACTCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

GAGTGGGAAG CGAAAATCCT GGAACTGGCT GGCTTCCTGG ATTCTTACAT 300 

TCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 3 50 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 4 00 

CGCGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTAGGTG TTCTGCTGCG TGGTAT C AAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGYCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 7 00 

CGTACTACTG ACGTGACTGG TACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGTCTG CGTTTCGCAA TC 832 



2) INFORMATION FOR SEQ ID NO: 176 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 716 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 13301 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176 

TCTGCTGCTG ACGGTCCAAT GCCACAAACT CGTGAACACA TTCTTTTATC 50 

ACGTAACGTT GGTGTACCAG CATTAGTAGT ATTCTTAAAC AAAGTTGACA 100 

10 TGGTTGACGA TGAAGAATTA TTAGAATTAG TAGAAATGGA AGTTCGTGAC 150 

TTATTAAGCG AATATGACTT CCCAGGTGAC GATGTACCTG TAATCGCTGG 2 00 

TTCAGCATTA AAAGCTTTAG AAGGCGATGC TCAATACGAA GAAAAAATCT 250 

TAGAATTAAT GGAAGCTGTA GATACTTACA TTCCAACTCC AGAACGTGAT 300 

TCTGACAAAC CATTCATGAT GCCAGTTGAG GACGTATTCT CAATCACTGG 350 

15 TCGTGGTACT GTTGCTACAG GCCGTGTTGA ACGTGGTCAA ATCAAAGTTG 400 

GTGAAGAAGT TGAAATCATC GGTTTACATG ACACATCTAA AACAACTGTT 450 

ACAGGTGTTG AAATGTTCCG TAAATTATTA GACTACGCTG AAGCTGGTGA 500 

CAACATTGGT GCATTATTAC GTGGTGTTGC TCGTGAAGAC GTACAACGTG 550 

GTCAAGTATT AGCTGCTCCT GGTTCAATTA CACCACATAC TGAATTCAAA 600 

20 GCAGAAGTAT ACGTATTATC AAAAGACGAA GGTGGACGTC ACACTCCATT 650 

CTTCTCAAAC TATCGTCCAC AATTCTATTT CCGTACTACT GACGTAACTG 700 

GTGTTGTTCA CTTACC 716 



25 



35 



2) INFORMATION FOR SEQ ID NO: 177 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 719 bases 
30 (B) v TYPE: Nucleic acid 

(C) :STRANDEDNESS : Double 

(D) TOPOLOGY: Linear- 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 29247 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 177 

TTCTTTTATC ACGTAACGTT GGTGTACCAG CATTAGTAGT ATTCTTAAAC 50 

AAAGTTGACA TGGTTGACGA TGAAGAATTA TTAGAATTAG TAGAAATGGA 100 

AGTTCGTGAC TTATTAAGCG AATATGACTT CCCAGGTGAC GATGTACCTG 150 

45 TAATCGCTGG TTCAGCATTA AAAGCTTTAG AAGGCGATGC TCAATACGAA 200 

GAAAAAATCT TAGAATTAAT GGAAGCTGTA GATACTTACA TTCCAACTCC 250 

AGAACGTGAT TCTGACAAAC CATTCATGAT GCCAGTTGAG GACGTATTCT 300 

CAATCACTGG TCGTGGTACT GTTGCTACAG GCCGTGTTGA ACGTGGTCAA 350 

ATCAAAGTTG GTGAAGAAGT TGAAATCATC GGTTTACATG ACACATCTAA 400 

50 AACAACTGTT ACAGGTGTTG AAATGTTCCG TAAATTATTA GACTACGCTG 450 

AAGCTGGTGA CAACATTGGT GCATTATTAC GTGGTGTTGC TCGTGAAGAC 500 

GTACAACGTG GTCAAGTATT AGCTGCTCCT GGTTCAATTA CACCACATAC 550 

TGAATTCAAA GCAGAAGTAT ACGTATTATC AAAAGACGAA GGTGGACGTC 600 

ACACTCCATT CTTCTCAAAC TATCGTCCAC AATTCTATTT CCGTACTACT 650 

55 GACGTAACTG GTGTTGTTCA CTTACCAGAA GGTMCTGAAA TGGTAATGCC 7 00 

TGGTGATAAC GTTGAAATG 719 



60 2) INFORMATION FOR SEQ ID NO: 178 
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( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 625 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 33591 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178 



15 



GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC ATTAGTAGTA 50 

TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 100 

AGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGACG 150 

ATGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGATGCT 200 

20 CAATACGAAG AAAAAATCTT AGAATTAATG GAAGCTGTAG ATACTTACAT 250 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 300 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 350 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTTTACATGA 400 

CACATCTAAA ACAACTGTTA CAGGTGTTGA AATGTTCCGT AAATTATTAG 450 

25 ACTACGCTGA AGCTGGTGAC AACATTGGTG CATTATTACG TGGTGTTGCT 500 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCAATTAC 550 

ACCACATACT GAATTCAAAG CAGAAGTATA CGTATTATCA AAAGACGAAG 600 

GTGGACGTCA CACTCCATTC TTCTC 625 

30 

2 ) INFORMATION FOR SEQ ID NO:- 179 

(i) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 704 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 43300 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179 



45 



GTTGGTGTAC CAGCATTAGT AGTATTCTTA AACAAAGTTG ACATGGTTGA 50 

CGATGAAGAA TTATTAGAAT TAGTAGAAAT GGAAGTTCGT GACTTATTAA 100 

50 GCGAATATGA CTTCCCAGGT GACGATGTAC CTGTAATCGC TGGTTCAGCA 150 

TTAAAAGCTT TAGAAGGCGA TGCTCAATAC GAAGAAAAAA TCTTAGAATT 200 

AATGGAAGCT GTAGATACTT ACATTCCAAC TCCAGAACGT GATTCTGACA 250 

AACCATTCAT GATGCCAGTT GAGGACGTAT TCTCAATCAC TGGTCGTGGT 300 

ACTGTTGCTA CAGGCCGTGT TGAACGTGGT CAAATCAAAG TTGGTGAAGA 350 

55 AGTTGAAATC ATCGGTTTAC ATGACACATC TAAAACAACT GTTACAGGTG 400 

TTGAAATGTT CCGTAAATTA TTAGACTACG CTGAAGCTGG TGACAACATT 450 

GGTGCATTAT TACGTGGTGT TGCTCGTGAA GACGTACAAC GTGGTCAAGT 500 

ATTAGCTGCT CCTGGTTCAA TTACACCACA TACTGAATTC AAAGCAGAAG 550 

TATACGTATT ATCAAAAGAC GAAGGTGGAC GTCACACTCC ATTCTTCTCA 600 

60 AACTATCGTC CACAATTCTA TTTCCGTACT ACTGACGTAA CTGGTGTTGT 650 
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TCACTTACCA GAAGGTACTG AAATGGTAAT GCCTGGTGAT AACGTTGAAA 
TGAC 



700 
704 
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20 
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2) INFORMATION FOR SEQ ID NO: 180 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 73 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus subsp, aureus 

(B) STRAIN: ATCC 6538 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180 

GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC ATTAGTAGTA 50 

TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 100 

AGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGACG 150 

ATGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGATGCT 200 

CAATACGAAG AAAAAATCTT AGAATTAATG GAAGCTGTAG ATACTTACAT 250 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 300 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 350 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTTTACATGA 400 

CACATCTAAA ACAACTGTTA CAGGTGTTGA AATGTTCCGT AAATTATTAG 4 50 

ACTACGCTGA AGCTGGTGAC AACATTGGTG CATTATTACG TGGTGTTGCT 500 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCAATTAC 550 

ACCACATACT GAATTCAAAG CAGAAGTATA CGTATTATCA AAAGACGAAG 600 

GTGGACGTCA CACTCCATTC TTCTCAAACT ATCGTCCACA ATTCTATTTC 650 

CGTACTACTG ACGTAACTGG TGTTGTTCAC TTACCAGAAG GTAC TGAAAT 700 

GGTAATGCCT GGTGATAACG TTGAAATGAC 730 



2) INFORMATION FOR SEQ ID NO: 181 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus auricular is 

(B) STRAIN: ATCC 33753 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181 

CGGTGCGATC TTAGTTGTAT CTGCAGCTGA TGGTCCAATG CCACAAACTC 50 

GTGAACACAT CTTATTATCA CGTAACGTTG GTGTACCAGC ATTAGTTGTA 100 

TTCTTAAACA AAGTTGACCA AGTTGACGAC GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGACT TATTAAGCGA ATACGACTAC CCAGGTGACG 200 

ATGTACCTGT AATCTCTGGT TCTGCGTTGA AAGCATTAGA AGGCGACAAA 250 
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GAATACGAAC 
TCCAACTCCA 
ACGTATTCTC 
CGTGGTCAAA 
CGGTTCACAA 
TAGACTACGC 
TCACGTGAAG 
TACACCACAC 
AAGGTGGACG 
TTCCGTACTA 
AATGGTAATG 
CAATCGCTAT 



AAAAAATCTT 
GAACGTGACT 
AATCACTGGT 
TCAAAGTCGG 
AAAACAACAG 
TGAAGCTGGT 
AAGTACAACG 
ACTAAATTCA 
TCACACTCCA 
CTGACGTAAC 
CCTGGCGATA 
CGAAGACGGT 



AGACTTAATG 
CTGATAAACC 
CGTGGTACTG 
TGAAGAAGTT 
TTACTGGTGT 
GACAACATCG 
TGGTCAAGTT 
CTGCAGAAGT 
TTCTTCTCTA 
AGGTGTTGTT 
ACGTTAAAAT 
ACTCGTTTCT 



CAACAAGTTG ACGATTACAT 
ATTCATGATG CCAGTTGAAG 
TTGCAACAGG CCGTGTTGAA 
GAAATCATCG GTATGAAAGA 
AGAAATGTTC CGTAAATTAT 
GTGCTTTATT ACGTGGTATT 
TTAGCTGCTC CTGGTTCAAT 
TTACGTATTA TCTAAAGATG 
ACTACCGTCC ACAATTCTAT 
ACTTTACCAG AAGGTACAGA 
GGAAGTTGAA TTAATTTCTC 
CAAT 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
834 



10 



15 



20 



25 



30 



35 



40 
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2) INFORMATION FOR SEQ ID NO: 182 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus capitis subsp. capitis 

(B) STRAIN: ATCC 27840 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182 

CGGCGGTATC TTAGTAGTAT CTGCTGCTGA CGGTCCAATG CCACAAACTC 50 

GTGAACACAT CTTATTATCA CGTAACGTTG GTGTACCAGC ATTAGTTGTA 100 

TTC TTAAACA AAGTTGACAT GGTAGACGAC GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGATG 200 

ATGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGATGCT 250 

CAATACGAAG AAAAAATCTT AGAATTAATG CAAGCAGTTG ATGATTACAT 300 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTATCCACGA 450 

AACTTCTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAATTATTAG 500 

AC TACGCTGA AGCTGGTGAC AACATCGGTG CTTTATTACG TGGTGTTGCT 550 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCAATCAC 600 

ACCACACACT AAATTCAAAG CGGAAGTTTA CGTTTTATCT AAAGACGAAG 650 

GTGGACGTCA CACTCCATTC TTCAGTAACT ACCGCCCACA ATTCTATTTC 700 

CGTACTACTG ACGTAACTGG TGTTGTTAAC TTACCAGAAG GTACTGAAAT 750 

GGTTATGCCT GGCGACAACG TTGAAATGAC AGTTGAATTA ATCGCTCCTA 800 

TCGCTATTGA AGACGGTACT CGTTTCTCAA TCGGA 835 



2) INFORMATION FOR SEQ ID NO: 183 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Macrococcus caseolyticus 

(B) STRAIN: ATCC 13548 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183 



GTATCTTAGT 
CACATCCTTT 
GAACAAAGTT 
TGGAAGTTCG 
CCTGTAATCG 
CGAAGACAAA 
CTCCAGAACG 
TTCTCAATCA 
ACAAGTTAAA 
CAGCAAAAAC 
TACGCTGAAG 
TGAAGACGTA 
CACATACTAA 
GGACGTCATA 
TACAACTGAC 
TAATGCCTGG 
GCGA 



AGTATCTGCT 
TATCACGTAA 
GACATGGTTG 
TGACTTATTA 
CTGGATCTGC 
ATCATGGAAT 
TGATTCTGAC 
CTGGTCGTGG 
GTTGGTGAAG 
TACAGTTACA 
CTGGAGATAA 
CAACGTGGAC 
ATTCAAAGCT 
CTCCATTCTT 
GTAACTGGTG 
AGATAACATC 



GCTGACGGTC 
CGTTGGTGTA 
ACGATGAAGA 
TCTGAATATG 
TTTAAAAGCA 
TAATGGACGC 
AAACCATTCA 
TACAGTTGCA 
AAGTTGAAAT 
GGTGTAGAAA 
CATCGGTGCT 
AAGTATTAGC 
GAAGTTTACG 
CACTAACTAC 
TAGTTAACTT 
GAAATGAACG 



CAATGCCACA AACTCGTGAA 
CCAGCATTAG TAGTATTCTT 
ATTATTAGAA TTAGTTGAAA 
ACTTCCCTGG TGACGATGTA 
TTAGAAGGCG TTGAAGAATA 
AGTTGATGAG TACATCCCAA 
TGATGCCAGT TGAGGACGTA 
ACTGGACGTG TTGAGCGTGG 
CATTGGTTTA ACTGAAGAAC 
TGTTCCGTAA ATTATTAGAT 
TTATTACGTG GTGTTTCTCG 
TAAACCAGGT TCAATTACTC 
TATTATCTAA AGAAGAAGGT 
CGCCCTCAGT TCTACTTCCG 
ACCAGAAGGT ACTGAAATGG 
TTGAATTAAT TTCTCCAATC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
804 



2) INFORMATION FOR SEQ ID NO: 184 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:- 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDE'DNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus cohnii 

(B) STRAIN: DSM 20260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184 

CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCACAAACTC 50 

GTGAACATAT CCTTTTATCA CGTAACGTTG GTGTTCCAGC ATTAGTTGTA 100 

TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGACG 200 

ATGTACCTGT AATCTCTGGT TCAGCATTAA AAGCTCTTGA AGGCGACGCT 250 

GACTATGAGC AAAAAATCTT AGACTTAATG CAAGCTGTTG ATGACTTCAT 300 

TCCAACACCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG GCGTGTTGAA 400 

CGTGGTCAAA TCAAAGTCGG TGAAGAAGTT GAAATCATCG GTATGCAAGA 450 

AGATTCAAGC AAAACAACTG TTACTGGTGT AGAAATGTTC CGTAAATTAT 500 

TAGACTACGC TGAAGCTGGT GACAACATTG GTGCGTTATT ACGTGGTGTT 550 

GCACGTGAAG ACATCCAACG TGGTCAAGTT TTAGCTGCTC CTGGTTCAAT 600 

TACACCACAC ACAAACTTTA AAGCGGAAGT TTACGTTTTA TCAAAAGATG 650 

AAGGTGGCCG TCATACGCCA TTCTTCAGTA ACTATCGCCC ACAATTCTAT 700 

TTCCGTACTA CTGACGTAAC AGGTGTTGTT ACTTTACCAG AAGGTACTGA 750 

AATGGTTATG CCTGGCGACA ACGTAGAAAT GGAAGTTGAA CTAATTTCTC 800 

CAATCGCTAT CGAAGACGGT ACACGTTTCT CT 832 
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2) INFORMATION FOR SEQ ID NO: 185 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 699 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: CSG 269 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185 

ATCTGCTGCT GACGGTCCAA TGCCACAAAC TCGTGAACAC ATCTTATTAT 50 

CACGTAACGT TGGTGTACCA GCATTAGTTG TATTCTTAAA CAAAGTTGAC 100 

ATGGTAGACG ACGAAGAATT ATTAGAATTA GTTGAAATGG AAGTTCGTGA 150 

CTTATTAAGC GAATATGACT TCCCAGGTGA CGATGTACCT GTAATCGCTG 200 

GTTCTGCATT AAAAGCATTA GAAGGCGATG CTGAATACGA ACAAAAAATC 250 

TTAGACTTAA TGCAAGCAGT TGATGATTAC ATTCCAACTC CAGAACGTGA 300 

TTCTGACAAA CCATTCATGA TGCCAGTTGA GGACGTATTC TCAATCACTG 350 

GTCGTGGTAC TGTTGCTACA GGCCGTGTTG AACGTGGTCA AAT CAAAGTT 400 

GGTGAAGAAG TTGAAATCAT. CGGTATGCAC GAAACTTCTA AAACAACTGT 450 

TACTGGTGTA GAAATGTTCC GTAAATTATT AGACTACGCT GAAGCTGGTG 500 

ACAACATCGG TGCTTTATTA CGTGGTGTTG CACGTGAAGA CGTACAACGT 550 

GGTCAAGTAT TAGCTGCTCC* TGGTTCTATT ACACCACACA CAAAATTCAA 600 

AGCTGAAGTA TACGTATTAT; CTAAAGATGA AGGTGGACGT CACACTCCAT 650 

TCTTCACTAA CTATCGCCCA' CAATTCTATT TCCGTACTAC TGACGTAAC 699 



2) INFORMATION FOR SEQ ID NO: 186 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: ATCC 2 9970 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186 

CGGCGGTATC TTAGTAGTAT CTGCTGCTGA CGGTCCAATG CCACAAACTC 50 

GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC ATTAGTAGTA 100 

TTCTTAAATA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTACGTGACT TATTATCTGA ATACGACTTC CCAGGTGACG 200 

ATGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGATGCT 250 

CAATACGAAG AAAAAATCTT AGAATTAATG CAAGCAGTTG ATGACTACAT 300 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 
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CGTGGGCAAA TCAAAGTTGG TGAAGAAGTT 
CACTTCTAAA ACAACTGTTA CTGGTGTAGA 

ACTACGCTGA AGCTGGTGAC AACATCGGTG 

CGTGAAGACG TACAACGTGG TCAAGTATTA 

5 ACCTCACACA AAATTTAAAG CAGACGTATA 

GTGGACGTCA CACTCCATTC TTCACAAACT 

CGTACTACTG ACGTAACTGG TGTTGTTAAC 

GGTTATGCCT GGCGACAACG TTGAAATGAC 

TCGCGATTGA AGACGGTACT CGTTTCTCA 



GAAATCATTG fcittCTCraTSR* 4#0' 

AATGTTCCGT AAATTATTAG 500 

CATTATTACG TGGTGTTGCT 550 

GCTGCTCCAG GTTCAATCAC 600 

CGTTTTATCT AAAGACGAAG 650 

ATCGTCCACA ATTCTATTTC 7 00 

TTACCAGAAG GTACTGAAAT 750 

AGTAGAATTA ATCGCTCCTA 800 

829 



2) INFORMATION FOR SEQ ID NO: 187 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 705 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 
25 (B) STRAIN: CSG 123 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187 

CACAAACTCG TGAACACATT CTTTTATCAC GTAACGTTGG TGTACCAGCT 50 

30' TTAGTTGTAT TCTTAAACAA AGTTGATATG GTAGACGACG AAGAATTATT 100 

AGAATTAGTA GAAATGGAAG TTCGTGACTT ATTATCTGAA TATGACTTCC 150 

CAGGTGACGA CGTACCTGTA ATCGCTGGTT CAGCATTAAA AGCTTTAGAA 200 

GGCGACGAAA AATACGAAGA AAAAATCTTA GAATTAATGC ' AAGCAGTTGA 250 

TGACTACATT CCAACTCCAG AACGTGATTC TGACAAACCA TTCATGATGC 300 

35 CAGTTGAGGA CGTATTCTCA ATCACTGGTC GTGGTACTGT TGCTACAGGC 350 

CGTGTTGAAC GTGGTCAAAT CAAAGTTGGT GAAGAAGTTG AAATCATCGG 400 

TTTACATGAC ACTTCTAAAA CAACTGTTAC TGGTGTAGAA ATGTTCCGTA 450 

AGTTATTAGA CTACGCTGAA GCTGGTGACA ACATCGGTGC TTTATTACGT 500 

GGTGTTGCTC GTGAAGACGT ACAACGTGGT CAAGTATTAG CTGCTCCTGG 550 

40 TTCAATTACA CCACATACAA AATTCAAAGC GGAAGTTTAC GTTTTATCTA 600 

AAGACGAAGG TGGACGTCAC ACTCCATTCT TCAGTAACTA CCGCCCACAA 650 

TTCTATTTCC GTACTACTGA CGTAACTGGC GTTGTTCAAT TACCAGAAGG 700 

TACTG 705 



2) INFORMATION FOR SEQ ID NO: 188 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 678 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: CSG 23 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188; 

TTTTATCACG TAACGTTGGT GTACCAGCAT TAGTAGTATT CTTAAACAAA 50 

GTTGACATGG TTGACGATGA AGAATTATTA GAATTAGTTG AAATGGAAGT 100 

5 ACGTGACTTA TTATCTGAAT ACGACTTCCC AGGTGACGAC GTACCTGTAA 150 

TCGCTGGTTC AGCTTTAAAA GCTTTAGAAG GCGATGCTCA ATACGAAGAA 200 

AAAATCTTAG AATTAATGCA AGCAGTTGAT GATTACATTC CAACTCCAGA 250 

ACGTGACTCT G AT AAAC CAT TCATGATGCC AGTTGAGGAC GTATTCTCAA 300 

TCACTGGTCG TGGTACTGTT GCTACAGGTC GTGTTGAACG TGGTCAAATC 350 

10 AAAGTTGGTG AAGAAGTTGA AATTATTGGT ATCAAAGAAA CTTCTAAAAC 4 00 

AACTGTTACT GGTGTAGAAA TGTTCCGTAA ATTATTAGAC TACGCTGAAG 450 

CTGGTGACAA CATCGGTGCT TTATTACGTG GTGTTGCTCG TGAAGATGTA 500 

CAACGTGGTC AAGTATTAGC TGCTCCAGGT TCAATTACAC CTCACACAAA 550 

ATTCAAAGCA GACGTATACG TTTTATCAAA AGATGAAGGT GGACGTCATA 600 

15 CTCCATTCTT CACTAACTAT CGTCCACAAT TCTATTTCCG TACTACTGAC 650 

GTAACTGGTG TTGTTAACTT ACCAGAAG 678 



20 2) INFORMATION FOR SEQ ID NO: 189 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30' (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus > haemolyticus 

(B) STRAIN: CSG 33 



B5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189 



ACCAGCATTA GTAGTATTCT TAAATAAAGT TGACATGGTT GACGATGAAG 50 

AATTATTAGA ATTAGTTGAA ATGGAAGTAC GTGACTTATT ATCTGAATAC 100 

GACTTCCCAG GTGACGATGT ACCTGTAATC GCTGGTTCAG CATTAAAAGC 150 

TTTAGAAGGC GATGCTCAAT ACGAAGAAAA AATCTTAGAA TTAATGCAAG 200 

10 CAGTTGATGA CTACATTCCA ACTCCAGAAC GTGATTCTGA CAAACCATTC 250 

ATGATGCCAG TTGAGGACGT ATTCTCAATC ACTGGTCGTG GTACTGTTGC 300 

TACAGGCCGT GTTGAACGTG GTCAAATCAA AGTTGGTGAA GAAGTTGAAA 350 

TCATTGGTAT CCATGACACT TCTAAAACAA CTGTTACTGG TGTAGAAATG 400 

TTCCGTAAAT TATTAGACTA CGCTGAAGCT GGTGACAACA TCGGTGCATT 4 50 

15 ATTACGTGGT GTTGCTCGTG AAGACGTACA ACGTGGTCAA GTATTAGCTG 500 

CTCCAGGTTC AATCACACCT CACACAAAAT TTAAAGCAGA CGTATACGTT 550 

TTATCTAAAG ACGAAGGTGG ACGTCACACT CCATTCTTCA CAAACTATCG 600 

TCCACAATTC TATTTCCGTA CTACTGACGT AACTGGTGTT GTTAACTTAC 650 

CAGAAGGTAC TGAAATGG 668 

50 



2) INFORMATION FOR SEQ ID NO: 190 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 
5 (B) STRAIN: CSG 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190 

AAAGTTGACA TGGTTGACGA TGAAGAATTA TTAGAATTAG TTGAAATGGA 50 

10 AGTACGTGAC TTATTATCTG AATACGACTT CCCAGGTGAC GATGTACCTG 100 • 

TAATCGCTGG TTCAGCATTA AAAGCTTTAG AAGGCGATGC TCAATACGAA 150 

GAAAAAATCT TAGAATTAAT GCAAGCAGTT GATGATTACA TTCCAACTCC 200 

AGAACGTGAT TCTGACAAAC CATTCATGAT GCCAGTTGAG GACGTATTCT 250 

CAATCACTGG TCGTGGTACT GTTGCTACAG GCCGTGTTGA ACGTGGTCAA 300 

15 ATCAAAGTTG GTGAAGAAGT TGAAAT CATT GGTATCCATG ACACTTCTAA 350 

AACAACTGTT ACTGGTGTAG AAATGTTCCG TAAATTATTA GACTACGCTG 400 

AAGCTGGTGA CAACATTGGT GCATTATTAC GTGGTGTTGC TCGTGAAGAC 450 

GTACAACGTG GTCAAGTATT AGCTGCTCCA GGTTCAATCA CACCTCACAC 500 

AAAATTTAAA GCAGACGTAT ACGTTTTATC TAAAGACGAA GGTGGACGTC 550 

20 ACACTCCATT CTTCACAAAC TATCGTCCAC AATTCTATTT CCG 593 



2) INFORMATION FOR SEQ ID NO: 191 



25 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 828 bases 

(B) - TYPE: Nucleic acid 

(C) . STRANDEDNESS : Double 
30 (D) . TOPOLOGY: Linear. 

(ii) MOLECULE TYPE: Genomic DNA 

i. 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Staphylococcus hominis subsp. hominis 

(B) STRAIN: ATCC 27844 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191 

40 CGGCGCTATC TTAGTAGTAT CTGCTGCTGA TGGTCCAATG CCACAAACTC 50 

GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC ATTAGTAGTA 100 
TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT ^ 150 

TGAAATGGAA GTACGTGACT TATTATCTGA ATACGACTTC CCAGGTGACG 2 00 

ACGTACCTGT AATCGCTGGT TCAGCTTTAA AAGCTTTAGA AGGCGATGCT 250 

45 CAATACGAAG AAAAAATCTT AGAATTAATG CAAGCAGTTG ATGATTATAT 300 

TCCAACTCCA GAACGTGACT CTGATAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATTATTG GTATCAAAGA 450 

AACTTCTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAATTATTAG 500 

50 ACTACGCTGA AGCTGGTGAC AACATCGGTG CTTTATTACG TGGTGTTGCT 550 

CGTGAAGATG TACAACGTGG TCAAGTATTA GCTGCTCCAG GTTCAATTAC 600 

ACCTCACACA AAATTCAAAG CAGACGTATA CGTTTTATCA AAAGATGAAG 650 

GTGGACGTCA TACTCCATTC TTCTCTAACT ATCGTCCACA ATTCTATTTC 700 

CGTACTACTG ACGTAACTGG TGTTGTTAAC TTACCAGAAG GTACTGAAAT 750 

55 GGTAATGCCT GGTGACAACG TTGAAATGAC AGTAGAATTA ATCGCTCCTA 800 

TCGCGATTGA AGACGGTACT CGTTTCTC 828 



60 2) INFORMATION FOR SEQ ID NO: 192 
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( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 620 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

<D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: ATCC 35982 



15 



30 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192 



ATGGTCCAAT GCCACAAACT CGTGAACACA TTCTTTTATC ACGTAACGTT 50 

GGTGTACCAG CTTTAGTTGT ATTCTTAAAC AAAGTTGATA TGGTAGACGA 100 

CGAAGAATTA TTAGAATTAG TAGAAATGGA AGTTCGTGAC TTATTATCTG 150 

AATATGACTT CCCAGGTGAC GACGTACCTG TAATCGCTGG TTCAGCATTA 200 

20 AAAGCTTTAG AAGGCGACGA AAAATACGAA GAAAAAATCT TAGAATTAAT 250 

GCAAGCAGTT GATGACTACA TTCCAACTCC AGAACGTGAT TCTGACAAAC 300 

CATTCATGAT GCCAGTTGAG GACGTATTCT CAATCACTGG TCGTGGTACT 350 

GTTGCTACAG GCCGTGTTGA ACGTGGTCAA ATCAAAGTTG GTGAAGAAGT 400 

TGAAATCATC GGTTTACATG ACACTTCTAA AACAACTGTT ACTGGTGTAG 450 

25 AAATGTTCCG TAAGTTATTA GACTACGCTG AAGCTGGTGA CAACATCGGT 500 

GCTTTATTAC GTGGTGTTGC TCGTGAAGAC GTACAACGTG GTCAAGTATT 550 

AGCTGCTCCT GGTTCAATTA CACCACATAC AAAATTCAAA GCGGAAGTTT 600 

ACGTTTTATC TAAAGACGAA 62 0 



2) INFORMATION FOR SEQ ID NO: 193 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 692 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(B) STRAIN: CSG 170 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193 



CCAGCATTAG TAGTATTCTT AAACAAAGTT GACATGGTTG ACGATGAAGA 50 

ATTATTAGAA TTAGTTGAAA TGGAAGTACG TGACTTATTA TCTGAATACG 100 

50 ACTTCCCAGG TGACGACGTA CCTGTAATCG CTGGTTCAGC TTTAAAAGCT 150 

TTAGAAGGCG ATGCTCAATA CGAAGAAAAA ATCTTAGAAT TAATGCAAGC 200 

AGTTGATGAT TATATTCCAA CTCCAGAACG TGACTCTGAT AAACCATTCA 250 

TGATGCCAGT TGAGGACGTA TTCTCAATCA CTGGTCGTGG TACTGTTGCT 300 

ACAGGCCGTG TTGAACGTGG TCAAATCAAA GTTGGTGAAG AAGTTGAAAT 350 

55 TATTGGTATC AAAGAAACTT CTAAAACAAC TGTTACTGGT GTAGAAATGT 400 

TCCGTAAATT ATTAGACTAC GCTGAAGCTG GTGACAACAT CGGTGCTTTA 450 

TTACGTGGTG TTGCTCGTGA AGATGTACAA CGTGGTCAAG TATTAGCTGC 500 

TCCAGGTTCA ATTACACCTC ACACAAAATT CAAAGCAGAC GTATACGTTT 550 

TATCAAAAGA TGAAGGTGGA CGTCATACTC CATTCTTCTC TAACTATCGT 600 

60 CCACAATTCT ATTTCCGTAC TACTGACGTA ACTGGTGTTG TTAACTTACC 650 
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5 2) INFORMATION FOR SEQ ID NO: 194 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 684 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(B) STRAIN: CSG 36 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194 

20 

CATTCTTTTA TCACGTAACG TTGGTGTACC AGCATTAGTA GTATTCTTAA 50 

ACAAAGTTGA CATGGTTGAC GATGAAGAAT TATTAGAATT AGTTGAAATG 100 

GAAGTACGTG AC TTATTAT C TGAATACGAC TTCCCAGGTG ACGACGTACC 150 

TGTAATCGCT GGTTCAGCTT TAAAAGCTTT AGAAGGCGAT GCTCAATACG 200 

25 AAGAAAAAAT CTTAGAATTA ATGCAAGCAG TTGATGATTA TATTCCAACT 250 

CCAGAACGTG ACTCTGATAA ACCATTCATG ATGCCAGTTG AGGACGTATT 3 00 

CTCAATCACT GGTCGTGGTA CTGTTGCTAC AGGCCGTGTT GAACGTGGTC 350 

AAATCAAAGT TGGTGAAGAA GTTGAAATTA TTGGTATCAA AGAAACTTCT 400 

AAAACAACTG TTACTGGTGT AGAAATGTTC CGTAAATTAT TAGACTACGC 450 

30 TGAAGCTGGT GACAACATCG GTGCTTTATT ACGTGGTGTT^ GCTCGTGAAG 500 

ATGTACAACG TGGTCAAGTA TTAGCTGCTC CAGGTTCAAT TACACCTCAC 550; 

ACAAAATTCA AAGCAGACGT ATACGTTTTA TCAAAAGATG AAGGTGGACG 600 : 

TCATACTCCA TTCTTCTCTA ACTATCGTCC ACAATTCTAT TTCCGTACTA 650 

CTGACGTAAC TGGTGTTGTT AACTTACCAG AAGG 684 

35 



2) INFORMATION FOR SEQ ID NO: 195 



40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 685 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 
50 (B) STRAIN: CSG 6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195 



ACCAGCATTA GTAGTATTCT TAAACAAAGT TGACATGGTT GACGATGAAG 50 

55 AATTATTAGA ATTAGTTGAA ATGGAAGTAC GTGACTTATT ATCTGAATAC 100 

GACTTCCCAG GTGACGACGT ACCTGTAATC GCTGGTTCAG CTTTAAAAGC 150 

TTTAGAAGGC GATGCTCAAT ACGAAGAAAA AATCTTAGAA TTAATGCAAG 2 00 

CAGTTGATGA TTACATTCCA ACTCCAGAAC GTGACTCTGA TAAACCATTC 250 

ATGATGCCAG TTGAGGACGT ATTCTCAATC ACTGGTCGTG GTACTGTTGC 300 

50 TACAGGCCGT GTTGAACGTG GTCAAATCAA AGTTGGTGAA GAAGTTGAAA 350 
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TTATTGGTAT CAAAGAAACT 
TTCCGTAAAT TATTAGACTA 
ATTACGTGGT GTTGCTCGTG 
CTCCAGGTTC AATTACACCT 
TTATCAAAAG ATGAAGGTGG 
TCCACAATTC TATTTCCGTA 
CAGAAGGTAC TGAAATGGTA 



TCTAAAACAA 
CGCTGAAGCT 
AAGATGTACA 
CACACAAAAT 
ACGTCATACT 
CTACTGACGT 
ATGCCTGGCG 



CTGTTACTGG TGTAGAAATG 
GGTGACAACA TCGGTGCTTT 
ACGTGGTCAA GTATTAGCTG 
TCAAAGCAGA CGTATACGTT 
CCATTCTTCA CTAACTATCG 
AACTGGTGTT GTTAACTTAC 
ACAAC 



400 
450 
500 
550 
600 
650 
685 



LO 



L5 



20 



IS 



30 



35 



10 



15 



50 



55 



2) INFORMATION FOR SEQ ID NO: 196 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 611 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(B) STRAIN: CSG 62 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196 

GACTTATTAT CTGAATACGA CTTCCCAGGT GACGACGTAC CTGTAATCGC 50 

TGGTTCAGCT TTAAAAGCTT TAGAAGGCGA TGCTCAATAC GAAGAAAAAA 100 

TCTTAGAATT AATGCAAGCA GTTGATGATT ACATTCCAAC TCCAGAACGT 150 

GACT CTGATA AACCATTCAT GATGCCAGTT GAGGACGTAT TCTCAATCAC 200 

TGGTCGTGGT ACTGTTGCTA CAGGCCGTGT TGAACGTGGT CAAATCAAAG 250 

TTGGTGAAGA AGTTGAAATT ATTGGTATCA AAGATACTTC TAAAACAACT 300 

GTTACTGGTG TAGAAATGTT CCGTAAATTA TTAGACTACG CTGAAGCTGG 350 

TGACAACATC GGTGCTTTAT TACGTGGTGT TGCTCGTGAA GATGTACAAC 400 

GTGGTCAAGT ATTAGCTGCT CCAGGTTCAA TCACACCTCA CACAAAATTC 450 

AAAGCAGACG TATATGTTTT ATCAAAAGAT GAAGGTGGAC GTCATACTCC 500 

ATTCTTCACT AACTATCGTC CACAATTCTA TTTCCGTACT ACTGACGTAA 550 

CTGGTGTTGT TAACTTACCA GAAGGTACTG AAATGGTAAT GCCTGGCGAC 600 

AACGTTGAAA T 611 



2) INFORMATION FOR SEQ ID NO: 197 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus lugdunensis 

(B) STRAIN: ATCC 43809 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197 

CGGCGGTATC TTAGTAGTTT CTGCTGCAGA TGGTCCAATG CCACAAACTC 50 
GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTGCCAGC ATTAGTAGTA 100 
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TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGATT TATTAACTGA ATATGACTTC CCAGGTGACG 200 

ATGTGCCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGACGAA 250 

AAATACGAAG CTAAAATCTT AGAATTAATG GATGCAGTTG ATAACTACAT 300 

5 TCCAACTCCA GAACGTGACT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 4 00 

CGTGGTCAAA TCAAAGTCGG TGAAGAAGTT GAAATTATTG GTATCCACGA 450 

TACTACTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAATTATTAG 500 

ACTACGCTGA AGCTGGTGAC AACATCGGTG CGTTATTACG TGGTGTTGCT 550 

10 CGTGAAGATG TACAACGTGG ACAAGTATTA GCTGCTCCAG GTTCAATTAC 600 

ACCTCACACT AAATTTAAAG CTGACGTATA TGTTTTATCT AAAGATGAAG 650 

GTGGACGTCA TACACCATTC TTCTCAAACT ACCGCCCACA ATTCTATTTC 700 

CGTACTACAG ACGTAACTGG TGTTGTTAAC TTACCAGAAG GTACAGAAAT 750 

GGTTATGCCT GGCGACAACG TTGAAATGAC AGTTGAATTA ATCGCTCCAA 800 

15 TCGCTATCGA AGACGGAACT CGTTTCTC 828 



20 



50 



2) INFORMATION FOR SEQ ID NO: 198 



( i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 690 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: ATCC .35552 

* ■ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198 

35 AGTAGTATCT GCTGCTGATG GCCCAATGCC ACAAACTCGT GAACACATTC 50 

TTTTATCACG TAACGTTGGT GTTCCAGCAT TAGTTGTATT CTTAAACAAA 100 

GTTGACATGG TTGACGATGA AGAATTATTA GAATTAGTAG AAATGGAAGT 150 

TCGTGACTTA TTAAGCGAAT ATGACTTCCC AGGTGACGAT GTACCTGTAA 2 00 

TCTCTGGTTC TGCATTAAAA GCTTTAGAAG GCGACGCTGA CTATGAGCAA 250 

40 AAAATCTTAG ACTTAATGCA AGCTGTTGAT GACTTCATTC CAACACCAGA 3 00 

ACGTGATTCT GACAAACCAT TCATGATGCC AGTTGAGGAC GTATTCTCAA 350 

TCACTGGTCG TGGTACTGTT GCTACAGGCC GTGTTGAACG TGGTCAAATC 4 00 

AAAGTCGGTG AAGAAATCGA AATCATCGGT ATGCAAGAAG AATCAAGCAA 4 50 

AACAACTGTT ACTGGTGTAG AAATGTTCCG TAAATTATTA GACTACGCTG 500 

45 AAGCTGGTGA CAACATTGGT GCATTATTAC GTGGTGTTTC ACGTGATGAC 550 

GTACAACGTG GTCAAGTTTT AGCTGCTCCT GGTACTATTA CACCACATAC 6 00 

AAAATTCAAA GCGGATGTTT ACGTTTTATC TAAAGATGAA GGTGGTCGTC 650 

ATACACCATT CTTCACTAAC TACCGCCCAC AATTCTATTT 690 



2 ) INFORMATION FOR SEQ ID NO: 199 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 723 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: CSG 83 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199 

GCATTAGTTG TATTCTTAAA CAAAGTTGAC ATGGTTGACG ATGAAGAATT 50 

ATTAGAATTA GTAGAAATGG AAGTTCGTGA TTTATTAAGC GAATATGACT 100 

10 TCCCAGGTGA CGATGTACCT GTAATCTCTG GTTCTGCATT AAAAGCTTTA 150 

GAAGGCGACG CTGACTATGA GCAAAAAATC TTAGACTTAA TGCAAGCTGT 200 

TGATGACTTC ATTCCAACAC CAGAACGTGA TTCTGACAAA CCATTCATGA 250 

TGCCAGTTGA GGACGTATTC TCAATCACTG GTCGTGGTAC TGTTGCTACA 300 

GGCCGTGTTG AACGTGGTCA AATCAAAGTC GGTGAAGAAA TCGAAATCAT 350 

15 CGGTATGCAA GAAGAATCAA GCAAAACAAC TGTTACTGGT GTAGAAATGT 400 

TCCGTAAATT ATTAGACTAC GCTGAAGCTG GTGACAACAT TGGTGCATTA 450 

TTACGTGGTG TTTCACGTGA TGACGTACAA CGTGGTCAAG TTTTAGCTGC 500 

TCCTGGTACT ATTACACCAC ATACAAAATT CAAAGCGGAT GTTTACGTTT 550 

TATCTAAAGA TGAAGGTGGT CGTCATACAC CATTCTTCAC TAACTAC CGC 600 

20 CCACAATTCT ATTTCCGTAC TACTGACGTA ACTGGTGTTG TTAACTTACC 650 

AGAAGGTACT GAAATGGTTA TGCCTGGCGA TAACGTTGAA ATGGATGTTG 700 

AATTAATTTC TCCAATCGCT ATT 723 



25 



35 



2) INFORMATION FOR SEQ ID NO: 200 



( i ) SEQUENCE CHARACTERI STICS : 
(A) LENGTH: 697 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) ' STRAIN: CSsa 18 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200 

CGTTGGTGTT CCAGCATTAG TTGTATTCTT AAACAAAGTT GACATGGTTG 50 

ACGATGAAGA ATTATTAGAA TTAGTAGAAA TGGAAGTTCG TGACTTATTA 100 

AGCGAATATG ACTTCCCAGG TGACGATGTA CCTGTAATCT CTGGTTCTGC 150 

45 ATTAAAAGCT TTAGAAGGCG ACGCTGACTA TGAGCAAAAA ATCTTAGACT 2 00 

TAATGCAAGC TGTTGATGAC TTCATTCCAA CACCAGAACG TGATTCTGAC 250 

AAACCATTCA TGATGCCAGT TGAGGACGTA TTCTCAATCA CTGGTCGTGG 3 00 

TACTGTTGCT ACAGGCCGTG TTGAACGTGG TCAAATCAAA GTCGGTGAAG 350 

AAATCGAAAT CATCGGTATG CAAGAAGAAT CAAGCAAAAC AACTGTTACT 400 

50 GGTGTAGAAA TGTTCCGTAA ATTATTAGAC TACGCTGAAG CTGGTGACAA 450 

CATTGGTGCA TTATTACGTG GTGTTTCACG TGATGACGTA CAACGTGGTC 500 

AAGTTTTAGC TGCTCCTGGT ACTATTACAC CACATACAAA ATTCAAAGCG 550 

GATGTTTACG TTTTATCTAA AGATGAAGGT GGTCGTCATA CACCATTCTT 600 

CACTAACTAC CGCCCACAAT TCTATTTCCG TACTACTGAC GTAACTGGTG 650 

55 TTGTTAACTT ACCAGAAGGT ACTGAAATGG TTATGCCTGG CGATAAC 697 



60 



2 ) INFORMATION FOR SEQ ID NO: 201 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Staphylococcus sciuri subsp. sciuri 

(B) STRAIN: ATCC 29060 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201 

15 CGGCGGTATC TTAGTAGTAT CTGCTGCTGA CGGTCCAATG CCTCAAACTC 50 

GTGAGCACAT TCTTTTATCA CGTAACGTAG GTGTTCCTGC ATTAGTAGTA 100 

TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTATCTGA ATATGACTTC CCAGGCGACG 200 

ACGTTCCTGT AATTGCTGGT TCAGCATTAA AAGCATTAGA AGGCGACGAA 250 

20 GCTTACGAAG ACAAAATCAT GGAATTAATG GATGCTGTTG ATACATTCAT 300 

CCCAACTCCA GAACGTGACT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 

CGTGGTCAAA TCACTGTTGG TGAAGAAGTT GAAATCATCG GTTTAACTGA 450 

AGAATCTTCT AAAACAACTG TAACTGGTGT TGAAATGTTC CGTAAATTAT 500 

25 TAGACTTCGC TGAAGCTGGA GATAACATCG GTGCATTATT ACGTGGTGTT 550 

GCTCGTGAAG ACGTTAACCG TGGTCAAGTA TTAGCTAAAC CAGGTTCAAT 600 

CACACCTCAC ACTAAATTCA AAGCTGAAGT TTATGTATTA TCTAAAGACG 650 

AAGGTGGACG TCATACTCCA TTCTTCACAA ACTACCGCCC ACAATTCTAT 700 

TTCCGTACTA CTGACGTAAC TGGTGTAGTT AAC TTACCAG AAGGTACTGA 750 

30 AATGGTTATG CCTGGCGACA ACGTTGAAAT GGACGTTGAA TTAATTTCAC 800 

CAATCGCTAT , TGAAGACGGT ATCGTTTCTC AATCA 835 



35 2 ) INFORMATION FOR SEQ ID NO: 202 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: ATCC 27836 



50 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202 



CGGCGGTATC TTAGTTGTAT CTGCAGCTGA TGGTCCAATG CCACAAACTC 50 

GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC TTTAGTTGTA 100 

TTCTTAAACA AAGTTGATAT GGTAGACGAC GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGACT TATTATCTGA ATATGACTTC CCAGGTGACG 200 

55 ACGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGACGAA 250 

AAATACGAAG AAAAAATCTT AGAATTAATG CAAGCAGTTG ATGACTACAT 300 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTTTACATGA 450 

60 CACTTCTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAGTTATTAG 500 
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ACTACGCTGA AGCTGGTGAC 
CGTGAAGACG TACAACGTGG 
ACCACATACA AAATTCAAAG 
GTGGACGTCA CACTCCATTC 
CGTACTACTG ACGTAACTGG 
GGTTATGCCT GGTGATAACG 
TCGCGATTGA AGACGGTACT 



AACATCGGTG CTTTATTACG 
TCAAGTATTA . GCTGCTCCTG 
CGGAAGTTTA CGTTTTATCT 
TTCAGTAACT ACCGCCCACA 
CGTTGTTCAA TTACCAGAAG 
TTGAAATGAC AGTAGAATTA 
CGTTTCTCAA C 



PCT/CAOO/01150 

TGGTGTTGCT" 3% 6 

GTTCAATTAC 600 

AAAGACGAAG 650 

ATTCTATTTC 700 

GTACTGAAAT 750 

ATCGCTCCTA 800 

831 



2) INFORMATION FOR SEQ ID NO: 203 

(i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: CSG 50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203 

CGGCGGTATC TTAGTTGTAT CTGCAGCTGA TGGTCCAATG CCACAAACTC 50 

GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC TTTAGTTGTA 100 

TTCTTAAACA AAGTTGATAT GGTAGACGAC GAAGAAT TAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGACT TATTATCTGA ATATGACTTC CCAGGTGACG 200 

ACGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGACGAA 250 

AAATACGAAG AAAAAATCTT AGAATTAATG CAAGCAGTTG ATGACTACAT 3 00 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTTTACATGA 450 

CACTTCTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAGTTATTAG 500 

ACTACGCTGA AGCTGGTGAC AACATCGGTG CTTTATTACG TGGTGTTGCT 550 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCAATTAC 600 

ACCACATACA AAATTCAAAG CGGAAGTTTA CGTTTTATCT AAAGACGAAG 650 

GTGGACGTCA CACTCCATTC TTCAGTAACT ACCGCCCACA ATTCTATTTC 700 

CGTACTACTG ACGTAACTGG CGTTGTTCAA TTACCAGAAG GTACTGAAAT 750 

GGTTATGCCT GGTGATAACG TTGAAATGAC AGTAGAATTA ATCGCTCCTA 800 

TCGCGATTGA AGACGGTACT CGTTTCTCA 829 



2) INFORMATION FOR SEQ ID NO: 204 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bifidobacterium longum 

(B) STRAIN: ATCC 15707 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204 



TGGCGCTATC 
GCGAGCACGT 
GCCCTGAACA 
CGAAGAAGAG 
GCCCGGTCAT 
GACCACGAGA 
CGACTACATC 
CGATCGAGGA 
CGTGTCGAGC 
TATCCGTCCG 
AGACCATGGA 
GGTCTCGGCC 
CTCCGTCACC 
AGGACGAAGG 
TTCTACTTCC 
CGTCGAGATG 
TTCAGCCCAT 



CTCGTTGTGG 
GCTGCTCGCC 
AGTGCGACAT 
GTCCGCGACC 
CCACACCTCC 
AGTGGGTCCA 
CCGACCCCGG 
CGTCTTCACC 
GTGGCCAGCT 
ACCCAGCAGA 
CGCCTGCGAG 
GTGACGATGT 
CCGCACACCA 
CGGCCGTCAC 
GCACCACCGA 
GTTCAGCCGG 
CGCTATGGAG 



CCGCCACCGA 
CGTCAGGTTG 
GGTCGACGAT 
TCCTCGACGA 
GCTTACGGTG 
GTCCGTTAAG 
TTCACGACCT 
ATCTCCGGCC 
GGCCGTCAAC 
CCACCGTCAC 
GCTGGCGACA 
CGAGCGTGGC 
AGTTCGAGGG 
TCGCCGTTCT 
CGTCACCGGC 
GCGACCACGC 
GAAGGCCTGA 



CGGCCCGATG GCCCAGACTC 
GCGTTCCGAA GATCCTCGTC 
GAAGAGCTCA TCGAGCTCGT 
GAACGGCTTC GACCGTGACT 
CTCTGCACGA CGACGCTCCG 
GACCTCATGG ACGCTGTCGA 
GGACAAGCCG TTCCTGATGC 
GTGGTACCGT TGTCACCGGT 
ACCCCGGTCG AGATCGTTGG 
CTCCATCGAG ACCTTCCACA 
ACACCGGTCT GCTTCTGCGT 
CAGGTTGTGG CCAAGCCGGG 
CGAAGTCTAC GTGCTGACCA 
TCTCCAACTA CCGTCCGCAG 
GTCATCGAGC TGCCGGAAGG 
TACCTTCACC GTTGAGCTGA 
CCTTCGCTG 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
839 



2) INFORMATION FOR SEQ ID NO: 205 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 754 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Stenotrophomonas maltophilia 

(B) STRAIN: CDC F3338 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205 

CGGCGCGATC CTGGTGTGCT CGGCCGCTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGTCG CGCCAGGTCG GCGTGCCGTA CATCGTCGTG 100 

TTCCTGAACA AGGCCGACAT GGTCGACGAC GCCGAGCTGC TCGAGCTGGT 150 

CGAGATGGAA GTGCGCGAAC TGCTGAGCAA GTACGAGTTC CCGGGCGACG 2 00 

ACACCCCGAT CATCGCCGGT TCGGCCCGCC TGGCGCTGGA AGGCGACCAG 250 

AGCGACATCG GCGTGCCGGC CATCCTGAAG CTGGTCGACG CGCTGGACAG 300 

CTGGATTCCG GAGCCGGAGC GTGCGATCGA CAAGCCGTTC CTGATGCCGG 350 

TGGAAGACGT GTTCTCGATC TCGGGCCGCG GCACCGTGGT GACCGGTCGT 4 00 

ATCGAGCGCG GCGTGATCAA GGTTGGCGAC GAAATCGAAA TCGTCGGCAT 450 

CCGTCCGGTG CAGAAGACCA CCGTGACCGG CGTTGAAATG TTCCGCAAGC 500 

TGCTGGACCA GGGTCAGGCA GGCGACAACG CTGGCCTGCT GCTGCGCGGC 550 

ACCAAGCGTG ATGACGTCGA GCGTGGCCAG GTGCTGGCCA AGCCGGGCAC 600 

GATCAAGCCG CACACCAAGT TCGAAGGCGA AGTGTACGTC CTGTCGAAGG 650 

ACGAGGGCGG CCGCCACACC CCGTTCTTCA ACGGCTACCG TCCGCAGTTC 700 

TACTTCCGCA CCACCGACAT CACCGGCGCC GCTGCACTGC CGGAAGGCGT 750 

CGAA 754 



2) INFORMATION FOR SEQ ID NO: 206 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus acidominimus 
L0 (B) STRAIN: ATCC 51726 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206 

TGGTGCTATC CTTGTAGTAG CTTCAACTGA CGGACCAATG CCACAAACTC 50 

L5 GTGAGCACAT CCTTCTTTCA CGTCAAGTTG GTGTTAAAAA CCTTATCGTT 100 

TTCATGAACA AAGTTGACCT TGTTGATGAT GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATCCGTGACC TTCTTTCAGA ATACGATTTC CCAGGTGATG 2 00 

ATCTTCCAGT TGTTCAAGGT TCAGCTCTTA AAGCGCTTGA AGGTGATTCA 250 

GCACAAGAAG ATGTTATCAT GGAATTGATG TCAATCGTTG ACACATACAT 300 

20 TCCAGAACCA GAACGTGATA CTGACAAACC ATTGCTTCTT CCAGTCGAGG 350 

ATGTATTCTC AATCACTGGA CGTGGTACTG TTGCTTCAGG ACGTATCGAC 400 

CGTGGTACTG TTAAAGTTAA TGACGAAGTT GAAATCGTTG GTATCAAAGA 450 

CGAAATCTCT AAAGCAGTTG TTACTGGTGT TGAAATGTTC CGTAAACAAC 500 

TTGACGAAGG TCTTGCTGGA GATAACGTTG GTGTTCTTCT TCGTGGTGTA 550 

25 CAACGTGATG AAATCGAACG TGGTCAAGTT CTTGCTAAAC CAGGTTCAAT 600 

CAACCCACAC ACTAAATTCA AAGGTGAAGT TTACGTTCTT TCTAAAGAAG 650 

AAGGTGGACG TCACACTCCA TTCTTCGATA ACTACCGTCC TCAGTTCTAC 700 

TTCCGTACAA CTGACGTAAC TGGTTCAATC AAATTGCCAG AAGGTACTGA 750 

AATGGTAATG CCTGGTGATA ACGTAACTAT CGAAGTTGAG TTGATCCACC 800 

30 CAATCGCCGT TGAACAAGGT ACTACTTTCT CTATC 835 



2) INFORMATION FOR SEQ ID, NO: 207 



35 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 12403 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207 

50 CTATCCTTGT AGTTGCTTCA ACTGATGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA AGTTGGTGTT AAACACCTTA TCGTATTCAT 100 

GAACAAAGTT GACCTTGTTG ATGATGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATTCG TGACCTTCTT TCAGAATACG ACTTCCCAGG TGATGACCTT 200 

CCAGTTATCC AAGGTTCAGC TCTTAAAGCA CTTGAAGGCG ACGAAAAATA 250 

55 CGAAGACATC ATCATGGAAT TGATGAGCAC TGTTGATGAG TACATTCCAG 300 

AACCAGAACG TGATACTGAC AAACCTTTAC TTCTTCCAGT TGAAGATGTA 350 

TTCTCAATCA CTGGACGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 400 

TACTGTTCGT GTCAACGACG AAGTTGAAAT CGTTGGTATT AAAGAAGATA 450 

TCCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 500 

50 GAAGGTCTTG CAGGGGACAA CGTTGGTGTT CTTCTTCGTG GTGTTCAACG 550 
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TGATGAAATC GAACGTGGTC AAGTTCTTGC TAAACCAGGT ^{fASWMSf?^ 
CACACACTAA ATTTAAAGGT GAAGTTTACA TCCTTTCTAA AGAAGAAGGT 
GGACGTCATA CTCCATTCTT CAACAACTAC CGTCCACAAT TCTACTTCCG 
TACAACTGAC GTAACAGGTT CAATCGAACT TCCAGCAGGA ACAGAAATGG 
TTATGCCTGG TGATAACGTT ACTATCGAAG TTGAATTGAT TCACCCAATC 
GCCGTAGAAC AAGGTACTA 



Scfo 



650 
700 
750 
800 
819 



LO 



L5 



20 



25 



30 



S.5 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 208 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 12973 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208 

CTATCCTTGT AGTTGCTTCA ACTGATGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA AGTTGGTGTT AAACACCTTA TCGTATTCAT 100 

GAACAAAGTT GACCTTGTTG ATGATGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATTCG TGACCTTCTT TCAGAATACG ACTTCCCAGG TGATGACCTT 200 

CCAGTTATCC AAGGTTCAGC TCTTAAAGCA CTTGAAGGCG ATGAAAAATA 250 

CGAAGACATC ATCATGGAAT TGATGAGCAC TGTTGATGAG TACATTCCAG 300 

AACCAGAACG TGATACTGAC AAAC CTTTAC TTCTTCCAGT CGAAGATGTA 350 

TTCTCAATCA CTGGACGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 400 

TACTGTTCGT GTCAACGACG AAGTTGAAAT CGTTGGTATT AAAGAAGATA 450 

TCCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 500 

GAAGGTCTTG CAGGGGACAA CGTTGGTGTT CTTCTTCGTG GTGTTCAACG 550 

TGATGAAATC GAACGTGGTC AAGTTCTTGC TAAAC CAGGT TCAATCAACC 600 

CACACACTAA ATTTAAAGGT GAAGTTTACA TCCTTTCTAA AGAAGAAGGT 650 

GGACGTCATA CTCCATTCTT CAACAACTAC CGTCCACAAT TCTACTTCCG 700 

TACAACTGAC GTAACAGGTT CAATCGAACT TCCAGCAGGA ACAGAAATGG 750 

TTATGCCTGG TGATAACGTT ACTATCGAAG TTGAATTGAT TCACCCAATC 800 

GCCGTAGAAC AAGGTACTA 819 



2) INFORMATION FOR SEQ ID NO : 209 

( i ) S EQUENCE CHARACTER I S T I CS : 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 13813 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209 
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AGCTATCCTT GTAGTTGCTT CAACTGATGG ACCAATGCCA CAAACTCGTG 50 

AGCACATCCT TCTTTCACGT CAAGTTGGTG TTAAACACCT TATCGTATTC 100 

ATGAACAAAG TTGACCTTGT TGATGATGAA GAATTGCTTG AATTGGTTGA 150 

5 AATGGAAATT CGTGACCTTC TTTCAGAATA CGACTTCCCA GGTGATGACC 200 

TTCCAGTTAT CCAAGGTTCA GCTCTTAAAG CACTTGAAGG CGATGAAAAA ,250 

TACGAAGACA TCATCATGGA ATTGATGAGC ACTGTTGATG AGTACATTCC 300 

AGAACCAGAA CGTGATACTG ACAAACCTTT ACTTCTTCCA GTCGAAGATG 350 

TATTCTCAAT CACTGGACGT GGTACAGTTG CTTCAGGACG TATCGACCGT 400 

LO GGTACTGTTC GTGTCAACGA CGAAGTTGAA ATCGTTGGTA TTAAAGAAGA 450 

TATCCAAAAA GCAGTTGTTA CTGGTGTTGA AATGTTCCGT AAACAACTTG 500 

ACGAAGGTCT TGCAGGGGAC AACGTTGGTG TTCTTCTTCG TGGTGTTCAA 550 

CGTGATGAAA TCGAACGTGG TCAAGTTCTT GCTAAACCAG GTTCAATCAA 600 

CCCACACACT AAATTTAAAG GTGAAGTTTA CATCCTTTCT AAAGAAGAAG 650 

L5 GTGGACGTCA TACTCCATTC TTCAACAACT ACCGTCCACA AT"f CTACTTC 700 

CGTACAACTG ACGTAACAGG TTCAATCGAA CTTCCAGCAG GAACAGAAAT 750 

GGTTATGCCT GGTGATAACG TTACTATCGA AGTTGAATTG ATTCACCCAA 800 

TCGCCGTAGA ACAAGGTACT AC 822 

10 

2) INFORMATION FOR SEQ ID NO: 210 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: CDCss-1073 



B5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210 



CGGAGCTATC CTTGTAGTTG CTTCAACTGA TGGACCAATG CCACAAACTC 50 

GTGAGCACAT CCTTCTTTCA CGTCAAGTTG GTGTTAAACA CCTTATCGTA 100 

40 TTCATGAACA AAGTTGACCT TGTTGATGAT GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATTCGTGACC TTCTTTCAGA ATACGACTTC CCAGGTGATG 200 

ACCTTCCAGT TATCCAAGGT TCAGCTCTTA AAGCACTTGA AGGCGACGAA 250 

AAATACGAAG ACATCATCAT GGAATTGATG AGCACTGTTG ATGAGTACAT 3 00 

TCCAGAACCA GAACGTGATA CTGACAAACC TTTACTTCTT CCAGTTGAAG 3 50 

45 ATGTATTCTC AATCACTGGA CGTGGTACAG TTGCTTCAGG ACGTATCGAC 400 

CGTGGTACTG TTCGTGTCAA CGACGAAGTT GAAATCGTTG GTATTAAAGA 450 

AGATATCCAA AAAGCAGTTG TTACTGGTGT TGAAATGTTC CGTAAACAAC 500 

TTGACGAAGG TCTTGCAGGG GACAACGTTG GTGTTCTTCT TCGTGGTGTT 550 

CAACGTGATG AAATCGAACG TGGTCAAGTT CTTGCTAAAC CAGGTTCAAT 600 

50 CAACCCACAC ACTAAATTTA AAGGTGAAGT TTACATCCTT TCTAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCAACA ACTACCGTCC ACAATTCTAC 700 

TTCCGTACAA CTGACGTAAC AGGTTCAATC GAACTTCCAG CAGGAACAGA 750 

AATGGTTATG CCTGGTGATA ACGTTACTAT CGAAGTTGAA TTGATTCACC 800 

CAATCGCCGT AGAACAAGGT ACTAC 825 

55 



2) INFORMATION FOR SEQ ID NO: 211 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus anginosus 

(B) STRAIN: ATCC 33397 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211 



GGAGCTATCC TTGTAGTAGC TTCAACTGAC 
TGAACACATC CTTCTTTCAC GCCAAGTAGG 
TCATGAATAA AGTTGACTTG GTTGACGATG 
GAAATGGAAA TCCGTGACCT TCTTTCAGAA 
AATCCCAGTT ATCCAAGGTT CAGCTCTTAA 
AATATGAAGA CATCATCATG GAATTGATGG 
CCAGAACCAG AACGTGACAC TGACAAACCA 
TGTATTCTCA ATTACTGGAC GTGGTACTGT 
GTGGTACTGT TAAAGTCAAC GACGAAGTTG 
GAAATCCAAA AAGCAGTTGT TACTGGTGTT 
GGACGAAGGT CTTGCTGGAG ATAACGTAGG 
AACGTGACGA AATCGAACGT GGACAAGTTC 
CATCCACACA CTAAATTCAA AGGTGAAGTT 
AGGTGGACGT CATACTCCAT TCTTCAACAA 
TCCGTACTAC AGACGTTACA GGTTCAATCG 
ATGGTAATGC CTGGTGATAA CGTAACAATC 
AATTGCCGTA GAACAAGGAA CTACAT 



GGACCAATGC CTCAAACTCG 50 

TGTTAAATAC CTTATCGTCT 100 

AAGAATTGCT TGAATTGGTT 150 

TACGATTTCC CAGGTGATGA 200 

AGCTCTTGAA GGTGATGAAA 250 

ATACTGTTGA TGAATACATT 300 

TTGCTTCTTC CAGTTGAAGA 350 

TGCTTCAGGA CGTATCGACC 400 

AAATCGTTGG TATCCGTGAT 450 

GAAATGTTCC GTAAACAATT 500 

GGTTCTTCTT CGTGGTATCC 550 

TTGCTAAACC AGGTTCAATT 600 

TACATCCTTA CTAAAGAAGA 650 

CTACCGTCCT CAATTCTACT 700 

AACTTCCTGC AGGTACTGAA 750 

GACGTTGAAT TGATCCACCC 800 

* 826 



2) INFORMATION FOR SEQ ID NO: 212 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus bovis 

(B) STRAIN: ATCC 33317 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212 



TGGTGCTATC CTTGTAGTAG 
GTGAACACAT CCTTCTTTCA 
TTCATGAACA AAGTTGACCT 
TGAAATGGAA ATCCGTGACC 
AAATCCCTGT AATCCAAGGT 
CACTACGAAG ACATCATCAT 
TCCAGAACCA AAACGTGATA 
ACGTATTCTC AATCACTGGT 
CGTGGTACTG TTAAAGTCAA 
CGACATCCAA AAAGCTGTTG 
TTGATGAAGG TATCGCAGGG 



CTTCTACAGA TGGTCCAATG 
CGTCAAGTTG GTGTTAAACA 
TGTTGATGAC GAAGAATTGC 
TTCTTTCAGA ATATGATTTC 
TCAGCTCTTA AAGCCCTTGA 
GGAATTGATG AACACTGTAG 
CTGACAAACC ATTGCTTCTT 
CGTGGTACTG TAGCATCAGG 
CGACGAAGTT GAAATCGTTG 
TTACTGGTGT TGAAATGTTC 
GATAACGTTG GTGTTCTTCT 
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CCACAAACAC 5 0 

CCTTATCGTC 100 

TTGAATTGGT 150 

CCAGGTGATG 200 

AGGTGACACT 250 

ATGAATACAT 300 

CCAGTCGAAG 350 

ACGTATCGAC 4 00 

GTATCCGTGA 4 50 

CGTAAACAAC 500 

TCGTGGTATC 550 
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CAACGTGATG AAATCGAACG TGGTCAAGTT CTTGCTAAAC CAGGTTCAAT 
CCACCCACAC ACTAAATTCA AAGGTGAAGT TTACATCCTT AC T AAAGAAG 
AAGGTGGACG TCACACTCCA TTCTTCAACA ACTACCGTCC TCAATTCTAC 
TTCCGTACAA CTGACGTTAC AGGTTCAATC GAACTTCCAG CAGGTACTGA 
AATGGTAATG CCTGGTGATA ACGTTACTAT CGACGTTGAA TTGATTCACC 
CAATCGCCGT TGAACAAGGT ACTACAT 



600 
650 
700 
750 
800 
827 
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2 ) INFORMATION FOR SEQ ID NO: 213 

{ i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) ST RANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus anginosus (deposited as 
Streptococcus constellatus) 

(B) STRAIN: ATCC 2 7823 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213 

GCTATCCTCG TAGTAGCTTC AACTGATGGA CCAATGCCTC AAACTCGTGA 50 

ACATATCCTT CTTTCACGTC AAGTAGGTGT TAAATACCTT ATCGTCTTCA 100 

TGAACAAAGT TGACTTGGTT GACGATGAAG AATTGCTTGA ATTGGTTGAA 150 

ATGGAAATCC GTGACCTTCT TTCAGAATAC GATTTCCCAG GTGATGAAAT 2 00 

CCCAGTTATC CAAGGTTCAG CTCTTAAAGC TCTTGAAGGT GATGAAAAAT 2 50 

ATGAAGACAT CATCATGGAA TTGATGGATA CTGTTGATGA ATACATTCCA 300 

GAACCAGAAC GTGACACTGA CAAACCACTT CTTCTTCCAG TCGAAGATGT 3 50 

ATTCTCAATC ACTGGACGTG GTACTGTTGC TTCAGGACGT ATCGACCGTG 4 00 

GTACTGTTAA AGTCAATGAT GAAGTTGAAA TTGTTGGTAT TCGTGACGAA 4 50 

ATCCAAAAAG CAGTTGTTAC TGGTGTTGAA ATGTTCCGTA AACAATTGGA 500 

CGAAGGTCTT GCTGGAGATA ACGTAGGGGT TCTTCTTCGT GGTATCCAAC 550 

GTGACGAAAT CGAACGTGGA CAAGTTCTTG CTAAAC CAGG TTCAATTCAT 600 

CCACACACTA AATTCAAAGG TGAAGTTTAC ATCC TT ACTA AAGAAGAAGG 650 

TGGACGTCAT ACTCCATTCT TCAACAACTA CCGTCCTCAA TTCTACTTCC 7 00 

GTACTACAGA CGTTACAGGT TCAATCGAAC TTCCTGCAGG TACTGAAATG 750 

GTAATGCCTG GTGATAACGT AACAATTGAT GTTGAGTTGA TCCACCCAAT 800 

TGCCGTAGAA CAAGGAACTA C 821 



2 ) INFORMATION FOR SEQ ID NO: 214 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus cricetus 

(B) STRAIN: ATCC 19642 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214 

GCTATCCTTG TAGTAGCTTC TACAGACGGA CCAATGCCAC AAACTCGTGA 50 

ACACATCTTG CTTTCACGCC AAGTTGGTGT TAAGAGCCTT ATCGTCTTCA 100 

5 TGAACAAGGT TGACTTGGTT GACGATGAAG AATTGCTTGA ATTGGTTGAA 150 

ATGGAAATCC GTGATCTTCT TTCAGAATAC GATTTCCCAG GTGATGATAT 200 

CCCTGTTGTT CAAGGTTCAG CTCTTAAAGC CCTTGAAGGT GATACAGCTG 250 

CCGAAGACAA GATCATGGAA TTGATGGACA TCGTTGATGA CTACATTCCA 300 

GAACCAAAAC GTGATACTGA TAAGCCATTG CTTCTTCCAG TCGAAGACGT 350 

10 ATTCTCAATC ACTGGACGTG GTACTGTTGC TTCAGGACGT ATCGACCGTG 400 

GTACTGTTAA GGTCAATGAC GAAGTTGAAA TCGTTGGTAT CAAGGACGAA 450 

ATCCAAAAAG CGGTTGTTAC CGGAGTTGAA ATGTTCCGTA AACAATTGGA 500 

TGAAGGTCTT GCAGGGGATA ACGTTGGTGT GCTTCTTCGT GGTATCCAAC 550 

GTGATGAAAT CGAACGTGGT CAAGTATTGG CTGCACCTGG TTCAATCCAT 600 

15 CCACACACTA AATTCAAGGG TGAAGTTTAC ATCCTTTCTA AAGATGAAGG 650 

TGGACGTCAC ACTCCATTCT TCAACAACTA CCGTCCACAG TTCTACTTCC 700 

GTACAACTGA CGTAACTGGT TCAATCGAAT TGCCAGCAGG TACTGAAATG 750 

GTTATGCCTG GTGATAACGT TACTATCGAC GTTGAATTGA TCCACCCAAT 800 

CGCTGTTGAA AAAGGTACTA C 821 

20 



2) INFORMATION FOR SEQ ID NO: 215 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus cristatus 
35 (B) STRAIN: ATCC 51100 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215 

TATCCTTGTA GTAGCTTCAA CTGACGGACC AATGCCACAA ACTCGTGAGC 50 

40 ACATCCTTCT TTCACGTCAG GTTGGTGTTA AACACCTTAT CGTCTTCATG 100 

AACAAGATCG ACTTGGTTGA TGACGAAGAA TTGCTTGAAT TGGTTGAAAT 150 

GGAAATCCGT GACCTCTTGT CAGAATACGA CTTCCCAGGT GACGATCTTC 2 00 

CAGTTATCCA AGGTTCAGCT CTTAAAGCTC TTGAAGGTGA TACTAAGTAC 250 

GAAGACATCA TCATGGAATT GATGAACACT GTTGATGAGT ACATCCCAGA 300 

45 ACCAGAACGT GATACTGACA AACCTCTTCT TCTTCCAGTC GAAGACGTAT 350 

TCTCAATCAC TGGTCGTGGT ACAGTTGCTT CAGGACGTAT CGACCGTGGT 400 

ACTGTTCGTG TCAACGATGA AATCGAAATC GTTGGTATCA AAGAAGAAAT 450 

CCAAAAAGCA GTTGTTACTG GTGTTGAAAT GTTCCGTAAA CAGCTTGACG 500 

AAGGTCTTGC AGGGGACAAC GTAGGTGTAC TTCTTCGTGG TATCCAACGT 550 

50 GATGAAATCG AACGTGGTCA AGTTATCGCT AAACCAGGTT CAATCAACCC 600 

ACACACTAAA TTCAAGGGTG AAGTTTACAT CCTTACTAAA GAAGAAGGTG 650 

GACGTCACAC TCCATTCTTC AACAACTACC GTCCACAGTT CTACTTCCGT 700 

ACAACTGACG TTACAGGTTC AATCGAACTT CCAGCAGGTA CTGAAATGGT 750 

AATGCCTGGT GATAACGTAA CTATCGACGT TGAGTTGATC CACCCAATCG 800 

55 CCGTTGAACA AGGTACTCCT T 821 



60 



2) INFORMATION FOR SEQ ID NO: 216 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 792 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Streptococcus downei 

(B) STRAIN: ATCC 33748 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216 

15 AGTAGCTTCT ACTGATGGAC CAATGCCACA AACTCGTGAA CACATCTTGC 50 

TTTCACGTCA GGTTGGTGTT AAGAACCTTA TCGTCTTCAT GAACAAGGTT 100 

GACTTGGTTG ACGATGAAGA ATTGCTTGAA TTGGTTGAAA TGGAAATCCG 150 

TGACCTGCTT TCAGAATACG ATTTCCCAGG TGATGATATC CCTGTTGTTC 200 

AAGGTTCAGC TCTTAAGGCT CTTGAAGGTG ATACAGCTGC CGAAGACAAG 250 

2 0 ATCATGGAAT TGATGGACAT CGTTGATGAC TACATTCCAG AACCAAAACG 300 

TGATACTGAT AAGCCTTTGC TTCTTCCAGT CGAAGATGTA TTCTCAATCA 350 

CTGGACGTGG TACTGTAGCT TCAGGACGTA TCGACCGTGG TACTGTTAAG 400 

GTCAACGACG AAGTTGAAAT CGTTGGTATC AAGGACGAAA TCCAAAAAGC 450 

AGTTGTTACC GGAGTTGAAA TGTTCCGTAA ACAATTGGAC GAAGGTCTTG 500 

25 CAGGGGATAA CGTTGGTGTG CTTCTTCGTG GTATCCAACG TGATGAAATC 550 

GAACGTGGTC AAGTGTTGGC TGCGCCTGGT TCGATTCACC CACACACTAA 600 

GTTTAAAGGT GAAGTTTACA TCCTTTCTAA AGAAGAAGGT GGACGTCATA 650 

CTCCATTCTT TAACAACTAC CGTCCACAGT TCTACTTCCG TACAACTGAC 700 

GTAACTGGTT CAATCGAATT GCCAGCGGGT ACTGAAATGG TTATGCCTGG 750 

3 0 TGATAACGTT ACTATCGACG TTGAATTGAT CCACCCAATT GC 792 



2) INFORMATION FOR SEQ ID NO: 217 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 795 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Streptococcus dysgalactiae 

(B) STRAIN: ATCC 43078 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217 

50 GTAGTTGCTT CAACAGACGG ACCAATGCCA CAAACTCGTG AGCACATCCT 50 

CCTTTCACGT CAGGTTGGTG TTAAACACCT TATCGTGTTC ATGAACAAAA 100 

TTGACCTTGT TGACGATGAA GAATTGCTTG AATTGGTTGA AATGGAAATC 150 

CGTGACCTTC TTTCAGAATA CGATTTCCCA GGTGATGACC TTCCAGTTAT 2 00 

CCAAGGTTCA GCTCTTAAAG CTCTTGAAGG CGACACTAAA TTTGAAGACA 250 

55 TCATCATGGA ATTGATGGAT ACTGTTGATT CATACATTCC AGAACCAGAA 300 

CGTGACACTG ACAAACCATT GCTTCTTCCA GTCGAAGACG TATTCTCAAT 350 

CACAGGTCGT GGTACAGTTG CTTCAGGACG TATCGACCGT GGTACTGTTC 400 

GTGTCAACGA CGAAATCGAA ATCGTTGGTA TCAAAGAAGA AACTAAAAAA 450 

GCTGTTGTTA CTGGTGTTGA AATGTTCCGT AAACAACTTG ACGAAGGTCT 500 

50 TGCAGGAGAC AACGTAGGTA TCCTTCTTCG TGGTGTTCAA CGTGACGAAA 550 

130 
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TCGAACGTGG TCAAGTTATT GCTAAACCAG GTTCAATCAA CCCACACACT 
AAATTCAAAG GTGAAGTATA TATCCTTTCT AAAGACGAAG GTGGACGTCA 
CACTCCATTC TTCAACAACT ATCGTCCACA ATTCTACTTC CGTACAACTG 
ACGTAACAGG TTCAATCGAA CTTCCAGCTG GTACAGAAAT GGTTATGCCT 
GGTGATAACG TGACAATCAA CGTTGAGTTG ATCCACCCAA TCGCC 
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2) INFORMATION FOR SEQ ID NO: 218 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic apid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus equi subsp. equi 

(B) STRAIN: ATCC 9528 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218 

CGGAGCTATC CTTGTAGTTG CTTCAACTGA CGGACCAATG CCACAAACTC 50 

GTGAGCACAT CCTTCTTTCA CGTCAGGTTG GTGTTAAGCA CCTTATCGTG 100 

TTCATGAACA AGGTTGACCT TGTTGACGAT GAAGAATTGC TTGAGCTTGT 150 

TGAAATGGAA ATTCGTGACC TTCTTTCAGA ATATGATTTC CCAGGTGATG 200 

ACCTTCCAGT TATCCAAGGT TCAGCGCTTA AGGCTCTTGA AGGCGACAGC 250 

AAATACGAAG ATATCATCAT GGAATTGATG GATACTGTTG ATTCATACAT 300 

TCCAGAACCA GAACGTGACA CAGACAAGCC ATTGCTTCTT CCAGTCGAGG 350 

ACGTATTCTC AATCACTGGA CGTGGTACTG TTGCTTCAGG ACGTATCGAC 400 

CGCGGTACTG TTCGTGTTAA CGACGAAATC GAAATCGTTG GTATCAGAGA 450 

CGAGATCAAA AAAGCAGTTG TTACTGGTGT CGAAATGTTC CGTAAACAGC 500 

TTGACGAAGG TCTTGCAGGG GACAACGTTG GTGTTCTTCT TCGTGGTGTA 550 

CAACGTGATG AAATCGAACG TGGTCAAGTT ATTGC TAAGC CAGGTTCTAT 600 

CAACCCACAC ACTAAATTTA AAGGTGAAGT ATATATC CTT ACTAAAGAAG 650 

AAGGTGGACG TCACACACCA TTCTTCAACA ACTATCGTCC ACAATTCTAC 700 

TTCCGTACTA CTGACGTAAC AGGTTCAATC GAGCTTCCAG CAGGTACAGA 750 

AATGGTTATG CCTGGTGATA ACGTGACTAT TGACGTTGAG TTGATCCACC 800 

CAATCGCCGT AGAACAAGGT ACTACATT 82 8 



2) INFORMATION FOR SEQ ID NO: 219 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus ferus 

(B) STRAIN: ATCC 33477 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219 
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CGGTGCAATC CTTGTAGTAG CTTCTACAGA TGGACCAATG CCACAAACTC 50 

GTGAGCACAT CCTTCTTTCA CGTCAGGTAG GTGTTAAACA CCTTATCGTC 100 

TTCATGAACA AAGTTGACTT GGTTGACGAT GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATCCGTGACC TGCTTTCAGA ATATGATTTC CCAGGTGATG 200 

5 ACCTTCCAGT TATCCAAGGT TCAGCTCTTA AAGCGCTTGA AGGTGATACT 250 

GCTCAAGAAG ATGTTATCAT GGAATTGATG AAAACCGTTG ATGAGTACAT 300 

CCCAGAACCA GAACGTGATA CTGACAAACC ATTGCTTCTT CCAGTCGAAG 350 

ATGTATTCTC AATCACAGGT CGTGGTACTG TAGCTTCAGG ACGTATCGAT 400 

CGTGGTACTG TAAGAGTCAA CGATGAAGTT GAAATCGTTG GTATCAAAGA 450 

10 CGAAATCACT AAAGCAGTTG TTACCGGTGT TGAAATGTTC CGTAAACAAT 500 

TGGACGAAGG TCTTGCTGGT GATAACGTTG GTGTGCTTCT CCGTGGTGTG 550 

CAACGTGATG AAATCGAACG TGGTCAAGTA TTGGCTAAAC CAGGTTCAAT 600 

CAACCCACAC ACTAAATTTA AAGGTGAAGT TTACATCCTT ACTAAAGAAG 650 

AAGGTGGACG TCATACACCA TTCTTCAACA ACTACCGTCC ACAGTTCTAC 700 

15 TTCCGTACAA CTGACGTAAC TGGTTCAATC GAATTGCCAG CAGGTACTGA 750 

AATGGTTATG CCTGGTGATA ACGTGACTAT CGACGTTGAA TTGATCCACC 800 

CAATCGCCGT TGAACAAGGT ACTAC 825 
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2) INFORMATION FOR SEQ ID NO: 220 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 82 6 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus gordonii 

(B) STRAIN: ATCC 10.558.:' 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220 

CGGAGCTATC CTTGTAGTAG CTTCAACTGA TGGTCCTATG CCACAAACTC 50 

GTGAGCACAT CCTTCTCTCA CGCCAAGTTG GTGTTAAACA CTTGATCGTG 100 

TTCATGAACA AAGTTGACTT GGTTGACGAT GAAGAATTGC TTGAGTTGGT 150 

10 TGAAATGGAA ATCCGTGACC TCTTGTCAGA ATACGACTTC CCAGGTGACG 200 

ATCTTCCAGT TATCCAAGGT TCAGCTCTTA AAGCTCTTGA AGGTGACTCT 250 

AAATATGAAG ATATCATCAT GGAATTGATG AACACTGTTG ATGAGTACAT 300 

CCCAGAACCA GAACGCGACA CTGACAAACC ATTGCTTCTT CCAGTCGAAG 350 

ACGTATTCTC AATCACTGGA CGTGGTACAG TTGCTTCAGG ACGTATCGAC 4 00 

15 CGTGGTATCG TTAAAGT CAA TGACGAAATC GAAATCGTTG GTATCAAAGA 450 

AGAAATCCAA AAAGCAGTTG TTACTGGTGT TGAAATGTTC CGTAAACAGC 500 

TTGACGAAGG TCTTGCAGGG GACAACGTTG GTGTGCTTCT TCGTGGTATC 550 

CAACGTGATG AAATCGAACG TGGACAAGTT ATTGCTAAAC CAGGTTCAAT 600 

CAACCCACAC ACTAAATTCA AAGGTGAAGT TTATATCCTT ACTAAAGAAG 650 

50 AAGGTGGACG TCACACTCCA TTCTTCAACA ACTACCGTCC ACAGTTCTAC 700 

TTCCGTACAA CTGACGTTAC AGGTTCAATC GAACTTCCAG CAGGTACTGA 750 

AATGGTAATG CCTGGTGATA ACGTAACTAT CGACGTTGAG TTGATCCACC 800 

CAATCGCCGT TGAACAAGGT ACTACT 826 



2) INFORMATION FOR SEQ ID NO: 221 



(i) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 799 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus anginosus 

(B) STRAIN: ATCC 27335 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221 



TGTAGTAGCT TCAACTGACG GACCAATGCC TCAAACTCGT GAACATATCC 50 

TTCTTTCACG TCAAGTAGGT GTTAAATACC TTATTGTCTT CATGAACAAA 100 

15 GTTGACTTGG TTGACGATGA AGAATTGCTT GAATTGGTTG AAATGGAAAT 150 

CCGTGATCTT CTTTCAGAAT ACGATTTCCC AGGTGATGAT ATTCCAGTAA 2 00 

TCCAAGGTTC AGCACTTAAA GCTCTTGAAG GTGATGAAAA ATATGAAGAC 250 

ATCATCATGG AATTGATGAA TACTGTTGAT GAATATATTC CAGAACCAGA 3 00 

ACGTGATACT GAC AAAC CAT TGCTTCTTCC AGTCGAAGAT GTATTCTCAA 350 

20 TCACTGGACG TGGTACTGTT GCTTCAGGAC GTATCGACCG TGGTACTGTT 4 00 

AAAGTCAACG ATGAAGTTGA AATCGTTGGT ATCCGCGAGG AAATCCAAAA 450 

AGCAGTTGTT ACTGGTGTTG AAATGTTCCG TAAACAATTG GACGAAGGTC 500 

TTGCTGGAGA TAACGTAGGG GTTCTTCTTC GTGGTATCCA ACGTGACGAA 550 

ATTGAACGTG GACAAGTTCT TGCTAAACCA GGTTCAATTC ATCCACACAC 600 

25 TAAATTCAAA GGTGAAGTTT ACATCCTTAC TAAAGAAGAA GGTGGACGTC 650 

ATACTCCATT CTTCAACAAC TACCGTCCTC AATTCTACTT CCGTACTACA 700 

GACGTTACAG GTTCAATCGA ACTTCCTGCA GGTACTGAAA TGGTAATGCC 750 

TGGTGATAAC GTAACAATTG ATGTTGAGTT GATCCACCCA ATTGCCGTA 799 



2 ) INFORMATION FOR SEQ ID NO : 222 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus macacae 

(B) STRAIN: ATCC 35911 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222 



TGGTGCTATT CTTGTAGTAG CTTCAACTGA CGGTCCAATG CCTCAAACGC 50 

GTGAACATAT CCTTCTTTCA CGCCAAGTAG GTGTTAAAAA CCTTATTGTT 100 

50 TTCATGAATA AAGTTGACTT AGTTGATGAT GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATCCGTGATC TTCTTACAGA ATATGATTTC CCAGGCGATG 200 

AACTTCCAGT TATCCAAGGT TCAGCACTTA AAGCTCTTGA AGGTGATACT 250 
AAGTACGAAG ATATTATCAT GGAATTGTTG GATACTGTAG ATGATTACAT - 300 

CCCAGAACCA CAACGTGATA CTGACAAGCC ATTGCTTCTT CCAGTCGAAG 350 

55 ATGTTTTCTC TATTACTGGA CGTGGTACTG TTGCTTCAGG ACGTATTGAC 400 

CGTGGTACTG TTAAGGTTAA TGATGAAGTT GAAATCGTTG GTATTCGTGA 450 

CGATATTCAA AAAGCAGTTG TTACTGGTGT TGAAATGTTC CGTAAACAGC 500 

TTGACGAAGG TCTTGCTGGT GATAACGTCG GTGTCCTTCT TCGTGGTATC 550 

CAACGTGATG AAATTGAACG CGGTCAAGTT CTTGCTAAAC CAGGATCAAT 600 

50 TCATCCACAT ACTAAATTCA AAGGTGAAGT TTATATTCTT ACTAAAGAAG 650 
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AAGGTGGACG TCATACTCCA TTCTTTAACA 
TTCCGTACAA CTGATGTAAC TGGTTCAATT 
AATGGTTATG CCTGGTGATA ATGTTACGAT 
CAATCGCTGT TGAACAAGGT ACAAC 
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ACTACCGTCC ACAGTTCTAC 700 
GATTTGCCAG CAGGTACTGA 750 
TGATGTTGAA CTGATCCACC 800 

825 



2) INFORMATION FOR SEQ ID NO: 223 



10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus gordonii (deposited as 
20 Streptococcus mitis) 

(B) STRAIN: ATCC 33399 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223 

25 CTATCCTTGT AGTAGCTTCA ACTGACGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA GGTTGGTGTT AAACACCTTA TCGTCTTCAT 100 

GAACAAAGTT GACTTGGTTG ACGACGAAGA ATTGCTTGAA TTGGTTGAAA 150 
TGGAAATCCG TGACC TATTG TCAGAATACG ACTTCCCAGG TGACGATCTT . 200 

CCAGTTATCC AAGGTTCAGC TCTTAAAGCC CTTGAAGGTG ACACTAAATA 250 

3 0 CGAAGACATC GTTATGGAAT TGATGAACAC AGTTGATGAG TACATCCCAG 300 

AACCAGAACG TGACACTGAC AAACCATTGC TTCTTCCAGT CGAAGACGTA 350 

TTCTCAATCA CTGGTCGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 400 

TATCGTTAAA GTCAACGACG AAATCGAAAT CGTTGGTATC AAAGAAGAAA 450 

CTCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 500 

35 GAAGGTCTTG CCGGAGATAA TGTAGGTGTC CTTCTTCGTG GTGTTCAACG 550 

TGATGAAATC GAACGTGGAC AAGTTATTGC TAAACCAGGT TCAATCAACC 600 

CACACACTAA ATTCAAAGGT GAAGTTTACA TCCTTACTAA AGAAGAAGGT 650 

GGACGTCACA CTCCATTCTT CAACAACTAC CGTCCACAAT TCTACTTCCG 700 

TACTACTGAC GTTACAGGTT CAATCGAACT TCCAGCAGGT ACTGAAATGG 750 

40 TAATGCCTGG TGATAACGTG ACAATCGACG TTGAGTTGAT CCACCCAATC 800 

GCCGTAGAAC AAGGTACTAC AT 822 



15 2) INFORMATION FOR SEQ ID NO: 224 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mutans 

(B) STRAIN: ATCC 25175 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224 

50 
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CCCTGGTGCT ATCCTTGTAG TAGCTTCAAC TGATGGACCA ATGCCACAAA 50 

CTCGTGAACA CATTCTTCTT TCACGTCAAG TTGGTGTTAA ATACCTCATT 100 

GTCTTCATGA ATAAAGTTGA TTTGGTTGAC GATGAAGAAT TGCTTGAATT 150 

GGTTGAAATG GAAATCCGTG ATCTTCTTTC AGAATATGAT TTCCCAGGTG 2 00 

5 ATGATATTCC AGTTATTCAA GGTTCAGCTC TTAAAGCTCT TGAAGGCGAT 250 

ACTGCTCAAG AAG AT AT CAT CATGGAATTA ATGCATACTG TTGATGACTA 300 

CATTCCAGAT CCAGAACGTG ATACTGACAA GCCGCTCCTT CTTCCAGTCG 3 50 

AAGATGTTTT CTCAATCACT GGTCGTGGTA CTGTTGCTTC AGGACGTATT 400 

GATCGTGGTA CTGTTAAAGT TAACGATGAA GTTGAAATCG TTGGTATCCG 450 

10 TGATGACATT CAAAAAGCTG TTGTTACTGG TGTTGAAATG TTCCGTAAAC 500 

AATTGGATGA AGGTATTGCA GGGGATAATG TTGGTGTTCT CCTTCGTGGT 550 

ATCCAACGTG ATGAAATCGA ACGTGGTCAA GTTCTTGCTA AACCAGGTTC 600 

AATTCACCCA CATACTAAAT TCAAAGGTGA AGTTTATATC CTTACTAAAG 650 

AGGAAGGTGG ACGTCATACA CCATTCTTCA ATAACTATCG TCCACAATTC 700 

15 TACTTCCGTA CAACTGACGT AACTGGTTCA ATTGAGTTGC CAGCAGGTAC 750 

TGAAATGGTT ATGCCTGGTG ATAACGTTAC TATTGACGTT GAATTGATCC 800 

ATCCAATCGC TGTTGAACAA GGTACTA 827 



20 



2) INFORMATION FOR SEQ ID NO: 225 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 824 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL* SOURCE : 

(A) ORGANISM: Streptococcus parasanguinis 

(B) - STRAIN: ATCC 15912' 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225 

AGCTATCCTT GTAGTAGCTT CAACTGACGG ACCAATGCCA CAAACACGTG 50 

AACACATCCT TCTTTCACGT CAGGTTGGTG TTAAACACTT GATCGTCTTC 100 

ATGAACAAAG TTGACTTGGT TGATGATGAA GAATTGCTTG AATTGGTTGA 150 

40 AATGGAAATC CGTGACCTTC TTTCAGAATA CGATTTCCCA GGTGATGACC 200 

TTCCAGTTAT CCAAGGTTCA GCTCTTAAAG CTCTTGAAGG TGACTCTAAA 250 

TATGAAGATA TCATCATGGA ATTGATGGAT ACTGTTGATG AGTACATCCC 300 

AGAACCAGAA CGCGATACTG ACAAACCATT GCTTCTTCCA GTCGAAGACG 350 

TATTCTCAAT CACTGGACGT GGTACAGTTG CTTCAGGACG TATCGACCGT 400 

45 GGTGTTGTTC GTGTCAATGA TGAAATCGAA ATCGTTGGTA TCAAAGAAGA 450 

AATCCAAAAA GCAGTTGTTA CTGGTGTTGA AATGTTCCGT AAACAACTTG 500 

ACGAAGGTCT TGCAGGGGAT AACGTTGGTG TGCTTCTTCG TGGTATCCAA 550 

CGTGATGAAA TCGAACGTGG ACAAGTTATC GCTAAACCAG GTTCAATCAA 600 

CCCACACACT AAATTCAAAG GTGAAGTTTA CATCCTTACT AAAGAAGAAG 650 

50 GTGGACGTCA TACTCCATTC TTCAACAACT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTAACTGG ATCTATCGAA CTTCCACCAG GAAC TGAAAT 750 

GGTAATGCCT GGTGATAACG TGACTATCGA CGTTGAGTTG ATCCACCCAA 800 

TCGCCGTTGA ACAAGGTACT ACAT 824 



2) INFORMATION FOR SEQ ID NO: 226 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 824 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus ratti 

(B) STRAIN: ATCC 19645 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226 

TGGTGCTATC CTTGTAGTAG CTTCAACTGA TGGACCAATG CCGCAAACTC 50 

GTGAACACAT CTTGCTTTCA CGTCAAGTTG GTGTTAAATA CCTTATCGTC 100 

15 TTCATGAACA AGGTTGACTT GGTTGATGAT GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATCCGTGATC TTCTTTCAGA ATACGATTTC CCAGGTGATG 200 

ACATTCCAGT TATCCAAGGT TCAGCCCTTA AAGCTCTTGA AGGTGACACT 250 

GAACAAGAAG ATGTTAT CAT GGAATTGATG AAAACAGTTG ATGAGTACAT 300 

CCCAGATCCA GAACGCGATA CTGATAAGCC ATTGCTTCTT CCAGTCGAAG 350 

20 ACGTGTTCTC AATCACTGGA CGTGGTACTG TTGCATCAGG ACGTATCGAC 400 

CGTGGTACTG TTAAAGTCAA TGACGAAGTT GAAATCGTTG GTATCCGTGA 450 

TGACATCCAA AAAGCTGTTG TTACTGGTGT TGAAATGTTC CGTAAACAGC 500 

TTGACGAAGG TCTTGCTGGT GATAACGTTG GTGTACTTCT TCGTGGTATC 550 

CAACGTGATG AAATCGAACG CGGTCAAGTT CTTGCTAAAC CAGGTTCAAT 600 

25 TCATCCGCAT ACTAAATTTA AAGGTGAAGT TTACATCCTT ACTAAAGAAG 650 

AAGGCGGACG TCACACTCCA TTCTTCAACA ACTACCGTCC ACAGTTCTAC 700 

TTCCGTACAA CTGACGTAAC TGGTTCAATC GAATTGCCAG CAGGTACTGA 750 

AATGGTTATG CCTGGTGATA ACGTGACTAT CGACGTTGAA TTGATCCACC 800 

CAATCGCTGT TGAACAAGGT ACTA 824 

30 



2) INFORMATION' FOR SEQ ID NO: ' 227 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

10 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus sanguinis 
15 (B) STRAIN: ATCC 10556 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227 

TGTAGTAGCT TCAACTGACG GACCAATGCC ACAAACTCGT GAGCACATCT 50 

50 TGCTTTCACG TCAGGTTGGT GTTAAACACT TGATCGTCTT CATGAACAAA 100 

GTTGACTTGG TTGACGATGA AGAATTGCTT GAATTGGTTG AAATGGAAAT 150 

CCGTGACCTC TTGTCAGAAT ACGACTTCCC AGGTGACGAT CTTCCAGTTA 2 00 

TCCAAGGTTC AGCTCTTAAA GCTCTTGAAG GTGACTCTAA ATATGAAGAC 250 

ATCATCATGG AATTGATGGA CACTGTTGAT GAGTACATCC CAGAAC'CAGA 300 

55 ACGCGATACT GACAAGCCAT TGCTTCTTCC AGTCGAAGAC GTATTCTCAA 350 

TCACTGGTCG TGGTACAGTT GCTTCAGGAC GTATCGACCG TGGTATCGTT 400 

AAAGTCAACG ACGAAATCGA AATCGTTGGT ATCAAAGAAG AAATCCAAAA 450 

AGCAGTTGTT ACTGGTGTTG AAATGTTCCG TAAACAGCTT GACGAAGGTC 500 

TTGCAGGGGA CAACGTAGGT GTGCTTCTCC GTGGTATCCA ACGTGATGAA 550 

50 ATCGAACGTG GACAAGTTAT CGCTAAACCA GGTTCAATCA ACCCACACAC 600 
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TAAATTCAAG GGTGAAGTTT ATATCCTTAC 
ACACTCCATT CTTCAACAAC TACCGTCCAC 
GACGTTACAG GTTCAATCGA ACTTCCAGCA 
TGGTGATAAC GTAACAATCG ACGTTGAGTT 



TAAAGAAGAA GGCGGACGTC 
AGTTCTACTT CCGTACAACT 
GGTACTGAAA TGGTAATGCC 
GATCCACCCA ATCGC 



650 
700 
750 
795 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO; 228 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 795 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus aobrinus 

(B) STRAIN: ATCC 334 78 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228 

TGTAGTAGCT TCTACTGACG GACCAATGCC ACAAACTCGT GAACACATCT 50 

TGCTTTCACG CCAAGTTGGT GTTAAGAACC TCATCGTCTT CATGAACAAG 100 

GTTGACTTGG TTGATGATGA AGAATTGCTT GAATTGGTTG AAATGGAAAT 150 

CCGTGATCTT CTTTCAGAAT ACGATTTCCC AGGTGACGAC ATTCCTGTTG 200 

TTCAAGGTTC AGCTCTTAAG GCTCTTGAAG GTGATACAGC TGC CGAAGAC 250 

AAGATTATGG AATTGATGGA CATCGTTGAT GATTACATTC CAGAACCAAA 300 

ACGCGATACT GATAAGCCAT 'TGCTTCTCCC AGTCGAAGAC GTATTCTCAA 35.0 

TCACTGGTCG TGGTACTGTT GCTTCAGGAC GTATTGACCG TGGTACTGTT 4 00 

AAGGTTAACG ACGAAGTTGA AATCGTTGGT ATCCGTGACG ATATCCAAAA 450 

AGCAGTTGTT ACTGGAGTTG AAATGTTCCG TAAGCAATTG GACGAAGGTC 500 

TTGCTGGAGA TAACGTTGGT GTGCTTCTTC GTGGTATCCA ACGTGATGAA 550 

ATTGAACGTG GTCAAGTATT GGCTGCACCT GGTTCAATCC ACCCACACAC 600 

TAAGTTCAAG GGTGAAGTTT ACATCCTTTC TAAAGATGAA GGTGGACGTC 650 

ACACTCCATT CTTCAACAAC TACCGTCCAC AGTTCTACTT CCGTACAACT 700 

GACGTAACTG GTTCAATCGA ATTGCCAGCA GGTACTGAAA TGGTTATGCC 750 

TGGTGATAAC GTTACTATCG ACGTTGAATT GATCCACCCA ATCGC 795 



2) INFORMATION FOR SEQ ID NO: 229 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 797 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus suis 

(B) STRAIN: ATCC 43765 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229 

TGTAGTAGCT TCAACTGACG GTCCAATGCC ACAAACTCGT GAGCACATCC 50 
TTCTTTCACG TCAGGTTGGT GTTAAACACC TTATCGTCTT CATGAACAAA 100 
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GTTGACTTGG TTGACGATGA AGAATTGCTT 
CCGTGACCTT CTTTCAGAAT ACGATTTCCC 
TCCAAGGTTC AGCTCTTAAA GCTCTTGAAG 
ATCGTTATGG AATTGATGAA CACTGTTGAT 
5 ACGCGACACT GACAAAC CAT TGTTGCTTCC 
TCACTGGTCG TGGTACTGTA GCTTCAGGAC 
CGTGTCAACG ACGAAATCGA AATCGTTGGT 
AGCAGTTGTT ACTGGTGTTG AAATGTTCCG 
TTGCCGGCGA TAACGTTGGT GTGCTTCTTC 
10 ATCGAACGTG GTCAAGTTAT CTCTAAACCA 
TAAATTCAAA GGTGAAGTTT ACATCCTTAC 
ACACTCCATT CTTCGACAAC TACCGTCCAC 
GACGTAACTG GTTCAATCAA ATTGCCAGAA 
TGGTGATAAC GTTACTATCG ACGTTGAATT 

15 



GAGTTGGTTG AAATGGAAAT 150 

AGGTGATGAT CTTCCAGTTA 2 00 

GTGACTCTAA GTACGAAGAC 250 

GAGTACATTC CAGAACCAGA 3 00 

AGTCGAGGAC GTATTCTCAA 350 

GTATCGACCG TGGTACTGTT 4 00 

CTTCAAGAAG AAAAATCTAA 450 

TAAACAACTT GACGAAGGTC 500 

GTGGTGTACA ACGTGATGAA 550 

GGTTCTATCA ACCCACACAC 600 

TAAAGAAGAA GGTGGACGTC 650 

AGTTCTACTT CCGTACAACT 700 

GGTACTGAAA TGGTAATGCC 750 

GATCCACCCA ATCGCCG 797 



2) INFORMATION FOR SEQ ID NO: 230 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 793 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



( i i ) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus uberis 

30 (B) STRAIN:' ATCC 19436 

#* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: .230 

' * s— * ' f 

TTGTTGTTGC ATCAACTGAT GGACCAATGC CACAAACTCG TGAGCACATC 50 

35 CTTCTTTCAC GCCAAGTTGG TGTTAAACAC CTTATCGTTT TCATGAACAA 100 

AATCGACCTT GTTGACGATG AAGAATTGCT TGAATTAGTT GAAATGGAAA 150 

TCCGTGACCT TCTTTCAGAA TACGATTTCC CAGGTGATGA CCTACCAGTT 200 

ATCCAAGGTT CAGCTCTTAA AGCTCTTGAA GGTGATTCTA AATACGAAGA 250 

CATCATCATG GAATTGATGA AAACTGTTGA TGAGTATATT CCAGAACCAG 300 

4 0 AACGTGATAC AGACAAACCA TTACTTCTTC CAGTCGAAGA CGTATTCTCA 350 

ATCACAGGTC GTGGTACTGT AGCTTCAGGA CGTATCGATC GTGGTACTGT 400 

TCGTGTCAAC GACGAAATTG AAATCGTTGG TATCAAAGAA GAAACTAAAA 450 

AAGCAGTTGT TACTGGTGTT GAAATGTTCC GTAAACAACT TGACGAAGGT 500 

CTTGCAGGAG ATAACGTAGG TATCCTTCTT CGTGGTGTTC AACGTGACGA 550 

45 AATCGAACGT GGACAAGTTA TTGCTAAACC AGGTTCAATC AACCCACACA 600 

CTAAATTCAA AGGTGAAGTT TACATCCTTT CTAAAGATGA AGGTGGACGT 650 

CATACTCCAT TCTTCAACAA CTACCGTCCT CAATTCTATT TCCGTACAAC 700 

TGACGTAACA GGTTCAATCG AACTTCCAGC TGGTACTGAA ATGGTAATGC 750 

CTGGTGATAA CGTGACAATC AGCGTTGAGT TGATCCACCC AAT 793 

50 



2 ) INFORMATION FOR SEQ ID NO: 231 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



SO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus vestibularis 

(B) STRAIN: ATCC 49124 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231 



TTGTAGTAGC 
CTTCTTTCAC 
AGTTGACTTG 
TCCGTGACCT 
ATCCAAGGTT 
CATCATCATG 
AACGTGACAC 
ATCACTGGTC 
TCGTGTTAAT 
AAGCAGTTGT 
ATTGCCGGAG 
AATTGAACGT 
CTAAATTCAA 
CACACTCCAT 
TGACGTAACA 
CTGGTGATAA 



ATCTACTGAC 
GTCAGGTTGG 
GTTGACGATG 
TCTTTCAGAA 
CAGCTCTTAA 
GACTTGATGA 
TGACAAACCA 
GTGGTACTGT 
GACGAAGTTG 
TACTGGTGTA 
ATAACGTCGG 
GGTCAAGTAT 
AGGTGAAGTT 
TCTTCAACAA 
GGTTCAATCG 
CGTGACTATC 



GGACCAATGC 
TGTTAAACAC 
AAGAATTGCT 
TACGATTTCC 
AGCTCTTGAA 
ACACTGTTGA 
TTGTTGCTTC 
TGCTTCAGGA 
AAATCGTTGG 
GAAATGTTCC 
TGTCCTTCTT 
TGGCTGCACC 
TACATCCTTT 
CTACCGTCCA 
AACTTCCTGC 
GACGTTGAGT 



CACAAACTCG 
CTTATCGTCT 
TGAATTGGTT 
CAGGTGATGA 
GGTGATTCTA 
CGAATACATT 
CAGTCGAAGA 
CGTATCGACC 
TCTTAAAGAA 
GTAAACAACT 
CGTGGTATCC 
TGGTTCAATC 
CTAAAGAAGA 
CAGTTCTACT 
AGGTACTGAA 
TGATCCACCC 



TGAGCACATC 
TCATGAACAA 
GAAATGGAAA 
TATTCCAGTT 
AATACGAAGA 
CCAGAACCAG 
CGTATTCTCA 
GTGGTGTTGT 
GAAATCCAAA 
TGACGAAGGT 
AACGTGATGA 
AACCCACACA 
AGGTGGACGT 
TCCGTACAAC 
ATGGTTATGC 
AATCGCCG 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
798 



2) INFORMATION FOR SEQ ID NO: 232 

(i) SEQUENCE CHARACTERISTICS : 

(A) : LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) ? * STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tatumella ptyseos 

(B) STRAIN: ATCC 33301 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232 

GGCGCTATCC TGGTTGTTGC TGCAACTGAC GGCCCTATGC CTCAGACCCG 50 

TGAGCACATC CTGCTGGGCC GCCAGGTAGG CGTTCCTTAC ATCATCGTGT 100 

TCCTGAACAA ATGTGACATG GTTGATGATG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TCCGTGACCT GCTGTCACAG TACGACTTCC CGGGTGACGA 200 

CACGCCAATC GTTCGCGGTT CAGCGCTGAA AGCACTGGAA GGTGAAGGCG 250 

AGTGGGAAGA GAAGATTCTG GAGCTGGCTG GCTTCCTGGA TTCTTACATC 3 00 

CCTGAGCCAG AGCGTGCTAT CGATCAGCCG TTCCTGCTGC CAATCGAAGA 350 

CGTATTCTCA ATCTCCGGTC GTGGTACAGT TGTTACCGGT CGTGTAGAGC 400 

GCGGGATCAT CAAAGTCGGT GAAGAAGTTG AGATCGTTGG TATCAAAGAT 450 

ACTGCGAAAT CAACCTGTAC CGGTGTTGAA ATGTTCCGTA AACTGCTGGA 500 

CCAGGGTCAG GCGGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAGC 550 

GTGAAGAGAT CGAACGTGGT CAGGTTCTGG CTAAACCAGG TTCAATCAAA 600 

CCACACACCC AGTTCGAGTC AGAAGTTTAT ATTCTGTCTA AAGACGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

GTACAACTGA CGTGACCGGA ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

GTAATGCCTG GTGACAACAT CAAAATGGTT GTTACCCTGA TCCATCCAAT 800 

CGCGATGGAC GATGGTCTGC GTTTCGCAA 829 
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2) INFORMATION FOR SEQ ID NO: 233 

5 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Trabulsiella guamensis 

(B) STRAIN: ATCC 49490 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233 

20 GGCGCAATCC TGGTAGTAGC AGCGACTGAC GGCCCGATGC CGCAGACTCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 150 

GAGATGGAAG TTCGTGAACT GCTGTCTCAG TACGATTTCC CGGGCGATGA 200 

CACGCCGATC GTACGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG 250 

25 AGTGGGAAGC GAAAATCATC GAACTGGCAG GTTTCCTGGA TTCTTACATT 3 00 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 350 

CGTATTCTCC ATCTCTGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

GCGGTATCAT CAAAGTGGGT GAAGAAGTAG AAATCGTTGG TATCAAAGAG 450 

ACTGCGAAGT CAACCTGTAC TGGCGTAGAA ATGTTCCGCA AACTGCTGGA 500 

30 CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAC 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGG 650 

CGGCCGTCAC ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACAACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

35 GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCGATGGAC GACGGTCTGC GTTTCGCAA 829 



40 2) INFORMATION FOR SEQ ID NO: 234 

( i ) SEQUENCE CHARACTER I STICS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Veillonella parvula 

(B) STRAIN: ATCC 10790 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234 



CGGCGCTATC TTGGTTGTAT CCGCAGCTGA CGGCCCTATG CCTCAAACTC 50 

GCGAACACAT CTTGTTGGCT CGCCAAGTTG GTGTTCCTGC AATCGTAGTA 100 

TTCTTGAACA AAGCTGACAT GGTTGACGAT GAAGAATTGA TCGAATTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTTTCTTC CTACGAATTC CCTGGCGACG 2 00 

60 AAGTACCTAT CGTTGTAGGT TCCGCGTTGA AAGCTTTGGA AGGCGATGCT 250 
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CCCAACACCA GTTCGTGACA CTGATAAACC 
ATGTTTTCAC AATCACTGGT CGTGGTACAG 
CGTGGTCAAG TAAACGTTGG TGATACTGTT 
AAAAGCTGAA CAATACGTAG TAACAGGTCT 
TGGATTCTGC AGTAGCAGGT GACAACGTAG 
GATCGTAAAG ACATYGAACG TGGTCAAGTA 
CAACCCACAY ACAAAATTCA AAGCAGAAGT 
AAGGTGGTCG TCATACTCCA TTCTTCTCCA 
TTCCGTACAA CAGACGTAAC AGGTGTTGTA 
AATGTGTATG CCTGGCGATA ACGTAACAAT 
CAATCGCTAT CGAAGAAGGT CTTCG 
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GACGCTGTAG ACTCCTACAT 3 00 

ATTCTTGATG CCTGTGGAAG 350 

TAGCAACTGG CCGTGTTGAA 4 00 

GAAGTAGTAG GCTTGAAAGA 4 50 

TGAAATGTTC CGTAAAGTGT 500 

GTGCATTGCT TCGTGGTGTT 550 

TTGGCTAAAC CAGGTTCCAT 600 

ATACGTATTG ACTAAAGAAG 650 

ACTACCGTCC ACAATTCTAC 700 

AACCTTCCTG AAGGTGTAGA 750 

GGAAATCGAA TTGATTACTC 800 

825 



2) INFORMATION FOR SEQ ID NO: 235 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia enterocolitica 

(B) STRAIN: ATCC 9610 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235 

CGCTATCCTG GTTGTTGCTG CAACTGATGG CCCAATGCCA CAGACTCGTG 50 

AGCACATCCT GTTGGGTCGT 7 CAGGTTGGTG TTCCTTACAT CATCGTATTC 100 

ATGAACAAAT GTGACATGGT TGACGATGAA GAGCTGCTAG AACTGGTAGA 150 

AATGGAAGTG CGCGATCTTC TGTCTACCTA CGATTTCCCA GGCGATGATA 2 00 

CGCCAGTTGT TCGTGGTTCC GCGCTGAAAG CATTGGAAGG CGAACCTGAG 250 

TGGGAAGCAA AAATTATCGA ACTGGCTGGC TACCTGGATT CTTACATCCC 3 00 

AGAACCAGAG CGTGCTATCG ATAAGCCGTT CCTGCTGCCA ATCGAAGACG 3 50 

TATTCTCTAT CTCTGGTCGT GGTACTGTTG TAACGGGTCG TGTAGAGCGC 4 00 

GGTATCGTTA AAGTTGGTGA AGAAGTCGAA ATTGTTGGCC TGAAAGATAC 450 

CGTTAAATCT ACTTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGATG 500 

AAGGCCGTGC AGGTGAGAAC GTTGGTGTTC TGCTGCGTGG TATCAAGCGT 550 

GAAGATATCG AACGTGGTCA AGTTCTTGCT AAACCAGGTT CGATTAAACC 600 

ACACACCAAA TTTGAATCAG AAGTTTATAT TCTGAGCAAA GATGAAGGTG 650 

GTCGCCATAC TCCGTTCTTC AAAGGCTACC GTCCTCAGTT CTACTTCCGT 700 

ACAACTGATG TAAC CGGTAC TATTGAACTG CCAGAAGGCG TTGAGATGGT 750 

GATGCCAGGT GATAACATTC AAATGATTGT TAACCTGATT GCTCCTATCG 800 

CAATGGATGA CGGCTTGCGC TTTGC 825 



2) INFORMATION FOR SEQ ID NO: 236 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DN A 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia frederiksenii 

(B) STRAIN: ATCC 33641 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236 



GGCGCGATCC 
CGAGCACATT 
TCCTGAACAA 
GAAATGGAAG 
CACTCCAGTT 
AGTGGGAAGC 
CCACAGCCAG 
CGTATTCTCA 
GCGGTATCGT 
ACCATCAAGA 
CGAAGGCCGT 
GTGATGACGT 
CCACACACCA 
TGGTCGCCAT 
GTACAACTGA 
GTGATGCCAG 
CGCAATGGAT 



TGGTTGTTGC 
CTGTTAGGGC 
ATGTGACATG 
TTCGTGAACT 
ATCCGTGGTT 
AAAAATCATC 
AGCGTGCGAT 
ATCTCTGGCC 
TAAAGTCGGC 
CTACCTGTAC 
GCGGGTGAGA 
ACAACGTGGT 
AATTTGAATC 
ACTCCGTTCT 
CGTGACCGGT 
GTGATAACAT 
GACGGTCTGC 



TGCCACTGAT 
GTCAGGTGGG 
GTTGACGACG 
TCTGTCTCAG 
CTGCGCTGAA 
GAATTGGCTG 
TGATAAACCA 
GTGGTACTGT 
GAAGAAGTCG 
TGGTGTTGAA 
ACGTTGGTGT 
CAGGTATTGG 
AGAAGTTTAT 
TCAAAGGTTA 
ACTATCGAAC 
TCAAATGATT 
GCTTTGCG 



GGCCCGATGC CACAGACTCG 
TGTTCCTTAC ATCCTGGTCT 
AAGAGCTGCT GGAACTGGTA 
TACGATTTCC CTGGCGACGA 
AGCGCTGGAA GGCGAAGCTG 
AGGCGCTGGA TAGCTATATT 
TTCCTGCTGC CAATCGAAGA 
TGTCACCGGT CGTGTAGAGC 
AAATCGTTGG TATCATTGAT 
ATGTTCCGCA AATTGCTGGA 
TCTGCTACGT GGTACTAAAC 
CAAAACCAGG TTCTATCAAG 
ATTCTGAGCA AAGATGAAGG 
TCGTCCTCAG TTCTACTTCC 
TGCCAGAAGG CGTTGAGATG 
GTTAACCTGA TTGCTCCTAT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
828 



2 ) INFORMATION FOR SEQ ID NO: 237 

( i ) SEQUENCE CHARACTERISTICS : 

(A) - LENGTH: 813 bases 

(B) TYPE: Nucleic acid 

' (C) STRANDEDNESS: Double 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia intermedia 

(B) STRAIN: ATCC 29909 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237 

CTTGGTTGTC GCTGCAACTG ATGGTCCTAT GCCACAGACT CGCGAGCACA 50 

TCCTGCTAGG TCGTCAGGTG GGTGTTCCTT ACATCCTGGT CTTCCTGAAC 100 

AAGTGTGACA TGGTTGACGA TGAGGAGTTG CTGGAATTGG TAGAAATGGA 150 

AGTCCGCGAA CTTCTGTCTC AATATGATTT CCCTGGCGAT GATACTCCTG' 200 

TTATCCGTGG TTCAGCGCTG AAGGCGTTGG AAGGCGAGCC TGAATGGGAA 250 

GCAAAAATTA TCGAATTAGC TGAGGCGCTG GATAGTTATA TTCCACAGCC 3 00 

AGAGCGCGCG ATTGATAGAC CATTCTTGCT GCCAATCGAA GACGTATTCT 350 

CTATCTCAGG TCGTGGTACA GTCGTCACTG GTCGTGTAGA GCGTGGGATC 4 00 

GTTAAAGTTG GCGAAGAAGT TGAAATCGTT GGTATTATCG ATTCCATTAG 450 

AACAACATGT ACTGGCGTTG AAATGTTCCG CAAATTGCTG GACGAAGGCC 500 

GCGCGGGTGA GAACGTTGGT GTTCTACTGC GTGGGACTAA ACGTGATGAC 550 

GTACAGCGTG GTCAGGTATT AGCTAAGCCA GGTTCTATCA AGCCACATAC 600 

TAAATTCGAA TCCGAAGTTT ATATTCTGAG CAAAGATGAA GGCGGGCGTC 650 

ACACGCCGTT CTTCAAAGGC TACCGTCCTC AGTTCTACTT CCGTACAACG 700 

GATGTAACCG GTACTATTGA ATTGCCAGAC GGCGTTGAGA TGGTGATGCC 750 

AGGTGATAAC ATTCAAATGA TTGTTAACCT GATTGCACCT ATTGCGATGG 800 

ATGATGGTCT GCG 813 
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2) INFORMATION FOR SEQ ID NO: 238 

5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Yersinia pestis 

(B) STRAIN: KIM D27 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238 

20 GGAGCGATCT TGGTTGTTGC TGCAACCGAT GGCCCTATGC CGCAGACTCG 50 

TGAGCATATC CTGCTGGGCC GCCAGGTTGG TGTCCCATAC ATTATTGTCT 100 

TCCTGAACAA ATGTGACATG GTTGACGATG AAGAGTTGCT AGAGTTGGTT 150 

GAAATGGAGG TTCGTGAGCT TCTGTCTCAA TACGATTTCC CAGGCGACGA 200 

CACTCCAGTC ATCCGTGGTT CAGCGTTGAA AGCCCTGGAA GGTGACGCTG 250 

25 AGTGGGAAGC TAAAATTATC GAGTTGGCAG AAGCTCTGGA TAGCTATATT 300 

CCGCAACCAG AACGCGCTAT TGATAGACCA TTCCTATTGC CAATTGAAGA 350 

CGTATTCTCT ATTTCTGGTC GTGGTACTGT AGTTACTGGT CGTGTAGAAC 400 

GTGGTATTGT TAAGGTCGGC GAAGAAGTTG AAATCGTTGG TATTATCGAT 450 

ACGATTAAAA CAACTTGTAC TGGCGTTGAA ATGTTCCGCA AGCTGCTGGA 500 

3 0 TGAAGGCCGT GCTGGTGAAA ATGTTGGTGT TCTGCTGCGT GGTACTAAGC 550 

GTGACGATGT TCAGCGTGGT CAAGTACTGG CGAAACCAGG TTCTATCAAG 600 

CCACACACGA AGTTTGAGTC AGAAGTTTAT ATTCTGAGCA AAGATGAAGG 650 

CGGCCGTCAT ACACCGTTCT TCAAGGGCTA CCGTCGTCAG " TTCTACTTCC 700 

GTACAACTGA CGTGACCGGT ACCATTGAGC TGCCAGAAGG CGTTGAAATG 750 

35 GTCATGCCTG GTGACAACGT AAACATGGTT GTTAAC CTAA TTGCTCCTAT 800 

CGCAATGGAT GATGGTCTGC GCTTCGCAA 829 



40 2) INFORMATION FOR SEQ ID NO: 239 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia pseudotuberculosis 

(B) STRAIN: ATCC 29833 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 9 



TGGAGCGATC TTGGTTGTTG CTGCAACCGA TGGCCCTATG CCGCAGACTC 50 

GTGAGCATAT CCTGCTGGGC CGCCAGGTTG GTGTCCCATA CATTATTGTC 100 

TTCCTGAACA AATGTGACAT GGTTGACGAT GAAGAGTTGC TAGAGTTGGT 150 

TGAAATGGAG GTTCGTGAGC TTCTGTCTCA ATACGATTTC CCAGGCGACG 200 

0 ACACTCCAGT CATCCGTGGT TCAGCGTTGA AAGCCCTGGA AGGTGACGCT 250 
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GAGTGGGAAG 
TCCGCAACCA 
ACGTATTCTC 
CGCGGTATTG 
TACGATTAAA 
ATGAAGGCCG 
CGTGACGATG 
GCCACACACG 
GCGGCCGTCA 
CGTACAACTG 
GGTCATGCCT 
TCGCAATGGA 



CTAAAATTAT 
GAACGCGCTA 
TATTTCTGGT 
TTAAGGTCGG 
ACAACTTGTA 
TGCTGGTGAA 
TTCAGCGTGG 
AAGTTTGAGT 
TACACCGTTC 
ACGTGACCGG 
GGTGACAACG 
TGATGGT 



CGAGTTGGCA 
TTGATAGACC 
CGTGGTACTG 
CGAAGAAGTT 
CTGGCGTTGA 
AATGTTGGTG 
TCAAGTACTG 
CAGAAGTTTA 
TTCAAGGGCT 
TACCATTGAG 
TAAACATGGT 



GAAGCTCTGG ATAGCTATAT 
ATTCCTATTG CCAATTGAAG 
TAGTTACTGG TCGTGTAGAA 
GAAATCGTTG GTATTATCGA 
AATGTTCCGC AAGCTGCTGG 
TTCTGCTGCG TGGTACTAAG 
GCGAAACCAG GTTCTATCAA 
TATTCTGAGC AAAGATGAAG 
ACCGTCCTCA GTTCTACTTC 
CTGCCAGAAG GCGTTGAAAT 
TGTTAACCTA ATTGCTCCTA 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
817 



2) INFORMATION FOR SEQ ID NO: 240 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
-(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia rohdei 

(B) STRAIN: ATCC 43380 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240 

TGGCGCGATC CTGGTTGTTG CTGCAACTGA TGGCCCAATG CCACAGACTC 50 

GCGAGCACAT CCTGTTGGGT CGTCAAGTGG GTGTTCCTTA CATCTTAGTC 100 

TTCCTGAACA AGTGTGACAT GGTTGACGAC GAAGAGTTGC TGGAACTGGT 15.0 

TGAAATGGAA GTTCGTGAGC TTCTGTCTCA ATACGATTTC CCTGGCGATG 2 00 

ACACTCCGGT TATTCGTGGT TCCGCGCTGA AAGCGCTGGA AGGCGAGGCC 250 

GAGTGGGAAG CCAAAATTAT TGAACTTGCT GAAGCACTGG ATAGCTACAT 300 

TCCACAGCCA GAGCGCGCGA TTGATAAACC ATTCTTGCTG CCAATCGAAG 350 

ACGTATTCTC TATCTCAGGC CGTGGAACAG TTGTTACCGG GCGTGTTGAG 4 00 

CGCGGTATCG TCAGAGTGGG CGAAGAAGTT GAAATCGTGG GTATCATCGA 450 

CACCATTAAA ACCACTTGTA CCGGTGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCCGGTGAG AACGTTGGTG TTCTGCTGCG CGGTACTAAA 550 

CGCGATGACG TGCAACGTGG TCAAGTGTTG GCTAAACCAG GTTCTATTAA 600 

GCCGCATACC AAATTTGAGT CAGAAGTTTA TATTCTGAGC AAAGATGAAG 650 

GTGGTCGTCA TACTCCGTTC TTCAAAGGTT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTGACCGG TACCATCGAA CTGCCAGACG GTGTTGAGAT 750 

GGTGATGCCA GGTGATAACA TTCAAATGAT TGTTAACCTG ATTGCGCCTA 8 00 

TTGCAATGGA TGACGGTCTA CGATTTGCA 829 



2) INFORMATION FOR SEQ ID NO: 241 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yokenella regensburgei 

(B) STRAIN: ATCC 35313 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241 

TGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

10 TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCGGGCGACG 2 00 

ACACTCCGAT CATCCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGAAGCA 250 

GAGTGGGAAG CTAAAATCGT TGAGCTGGCT GGCTTCCTGG ATTCTTACAT 300 

CCCAGAACCA GTTCGTGCTA TCGACCTGCC GTTCCTGCTG CCGATCGAAG 350 

15 ACGTATTCTC CATCTCCGGT CGTGGCACCG TTGTTACCGG TCGTGTAGAG 400 

CGCGGTATCG TTAAAGTTGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTGCTAAG TCTACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GTTCCATCAA 600 

20 GCCGCACACC AAATTCGAAT CTGAAGTTTA TATCCTGTCC AAAGACGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACAACTG ACGTGACTGG TACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGC 804 

25 



2) INFORMATION FOR SEQ ID NO: 242 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 849 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : :Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Achromobacter xylosoxidans subsp. 
4 0 deni tri fi cans 

(B) STRAIN: ATCC 15173 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242 

45 TCAGTTCCCC CGCGATCACA TGCCCAAGAT CTACGAAGCG CTTACTCTGG 50 

CCGACGAGGG TTCCTCGTTC GCCGAAAAGG GTCTGACGCT GGAAGTGCAG 100 

CAACAGCTGG GCGACGGCGT GGTGCGTACC ATCGCGCTGG GCTCCAGCGA 150 

CGGCCTGCGC CGCGGTATGA AGGTCACCGG TACGGGCGCG CCGATCTCGG 200 

TGCCGGTCGG CACCGGCACG CTGGGCCGCA TCATGGACGT GCTGGGTCGT 250 

50 CCCATCGACG AAGCCGGCCC GATCCAGCAC GAAGAAAAGC GTGGCATTCA 3 00 

CCAGCCGGCT CCCCGTTTCG ACGAACTGTC GCCGTCGGTG GAACTGCTGG 350 

AAACCGGCAT CAAGGTTATT GACCTGGTCT GCCCGTTCGC CAAGGGCGGC 400 

AAGGTCGGCC TGTTCGGCGG CGCCGGCGTG GGCAAGACCG TCAACATGAT 450 

GGAACTGATC AACAACATCG CCAAGCAGCA CAGCGGCTTG TCGGTGTTCG 500 

55 CCGGCGTGGG CGAGCGTACC CGCGAAGGCA ACGACTTCTA CCACGAAATG 550 

GAAGAGTCGA ACGTTCTGGA CAAGGTTGCG ATGGTGTTCG GTCAGATGAA 600 

CGAACCCCCG GGCAACCGTC TGCGCGTGGC GCTGACCGGC CTGACCATGG 650 

CCGAGAAGTT CCGCGACGAA GGCCGCGACA TCCTGTTCTT CGTGGACAAC 700 

ATCTACCGCT ACACCCTGGC CGGTACGGAA GTGTCCGCGC TGCTGGGCCG 750 

50 TATGCCGTCG GCAGTGGGCT ACCAGCCCAC GCTGGCCGAA GAAATGGGCA 800 
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AGCTGCAAGA GCGCATCACC TCGACCAAGA CCGGCTCGAT CACCTCGA^ 849 



5 2) INFORMATION FOR SEQ ID NO: 243 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 787 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: AcinetoJbacter jbaumannii 

(B) STRAIN: ATCC 19606 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243 



20 



TGAAACTACT TTAGAAGTTC AGCAACAACT TGGTGATGGT GTTGTTCGTA 50 

CCATCGCAAT GGGTTCTACA GAAGGTCTTA AACGTGGTCT TACTGTAACT 100 

AGCACAAACG CACCGATCTC TGTTCCAGTT GGTACAGCCA CTCTTGGCCG 150 

TAT CATGGAC GTTTTAGGTC GTCCTATCGA TGAAGCAGGT CCTGTTGCGA 200 

25 CTGAAGAACG TTTGCCGATT CACCGTCAAG CGCCTTCTTA TGCTGAACAA 250 

GCAGCTTCTA CTGACCTTTT AGAAACTGGT ATTAAAGTCA TCGACTTACT 3 00 

TTGCCCGTTT GCGAAAGGTG GTAAAGTTGG TTTATTCGGT GGTGCTGGTG 350 

TTGGTAAAAC CGTTAACATG ATGGAATTGA TCAACAACAT CGCGAAAGCA 40.0 

CACTCAGGTT TATCTGTGTT TGCTGGTGTT GGTGAGCGTA CTCGTGAAGG 450 

30 TAATGACTTC TATCACGAAA TGAAAGATTC TAACGTTCTT GACAAAGTAG 500 

CAATGGTCTA CGGTCAGATG AACGAGCCAC CAGGTAACCG TTTACGCGTA 550 

GCGTTAACTG GTTTGACTAT GGCTGAATAC TTCCGTGATG AAAAAGACGA 60,0 

AAACGGTAAA GGTCGTGACG TATTATTATT CGTCGACAAC ATCTACCGTT 650 

ATACAC TTGC AGGTACTGAA GTATCAGCAT TGTTAGGTCG TATGCCATCT 700 

35 GCGGTAGGTT ACCAACCTAC ACTTGCAGAA GAAATGGGTG TTCTTCAAGA 750 

GCGTATTACA TCTACTAAAT CTGGTTCGAT CACTTCG 787 



10 2) INFORMATION FOR SEQ ID NO: 244 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 
i5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter lwoffii 

(B) STRAIN: CDCF 3697 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244 

GTTCCTAAGA TCTATGACGC TCTCCACGTT GATGGCACTG AAACTACATT 50 

AGAAGTTCAG CAACAACTTG GTGATGGCGT AGTTCGTACT ATTGCAATGG 100 

GTTCTACTGA AGGCCTTAAG CGTGGTTTGA ACGTAAC TAA CACTAACGCG 150 

CCGATTTCTG TACCAGTAGG TACAGCGACT CTAGGTCGTA TCATGGACGT 200 

TCTTGGTCGC CCAATCGACG AAGCTGGTCC AGTTGCGACT GAAGCGCGTT 250 
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TGCCGATTCA CCGTCAAGCA CCTTCTTATG 
GACCTTTTAG AAACTGGTAT TAAAGTCATC 
TAAAGGTGGT AAAGTTGGTC TGTTCGGTGG 
TAAACATGAT GGAGTTGATC AACAACATCG 
5 TCTGTATTCG CTGGTGTTGG TGAGCGTACT 
TCACGAGATG AAAGACTCAA ACGTTCTAGA 
GTCAGATGAA CGAGCCACCG GGTAACCGTT 
TTGACCATGG CTGAGTACTT CCGTGACGAG 
CCGTGACGTA CTATTGTTCG TAGATAACAT 
10 GTACTGAAGT ATCAGCACTT CTAGGTCGTA 
CAACCGACAC TTGCAGAAGA GATGGGTGTT 
GACTAAGTCT GGTTCGATTA CGTCA 
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CTGAACAAGC AGCTTCTACT 300 

GACTTACTTT GCCCGTTCGC 350 

TGCCGGTGTT GGTAAAACTG 400 

CTAAAGCGCA CTCAGGTTTA 450 

CGTGAAGGTA ATGACTTCTA 500 

CAAAGTAGCA ATGGTCTACG 550 

TACGCGTAGC GTTGACTGGT 600 

AAAGACGAAA ACGGCAAAGG 650 

CTACCGTTAT ACACTAGCAG 700 

TGCCGTCTGC AGTAGGTTAC 750 

CTTCAAGAAC GTATTACATC 800 

825 



15 



25 



50 



2 ) INFORMATION FOR SEQ ID NO: 245 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 837 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) ' STRAIN: CSG-197 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245 

CAATGAAGTT -CCAGAAATTA ACAATGCCTT AGTCGTAGAC GTTGAAAGAG 50 

ATGAAGGTAC AGTATCTCTT ACATTAGAAG TGGCATTACA ACTTGGCGAT 100 

GATGTCGTAC GTACAATTGC AATGGATTCT ACTGATGGTG TTAAACGTGG 150 

35 TACAGAAGTT CGAGATAGCG GAGATAGCAT CAGTGTTCCA GTTGGTGATG 200 

CTACGTTAGG ACGTGTGTTT AATGTTCTTG GTGATACAAT TGACTTAGAC 250 

GAGAAGCTTG ATACTTCTGT CAAACGTGAT CCAATTCATA GAGAAGCACC 300 

TGCATTCGAT CAATTATCAA CAAAAGTTGA AATCTTAGAA ACAGGTATTA 350 

AAGTAATTGA TTTACTTGCA CCATATATTA AAGGTGGTAA AATCGGTTTA 400 

40 TTCGGTGGCG CTGGTGTAGG TAAAACAGTA TTAATTCAAG AATTAATTAA 450 

TAATATAGCT CAAGAACATG GTGGTATTTC AGTATTTGCC GGCGTAGGTG 500 

AACGTACGCG TGAAGGTAAT GACTTATACT ACGAAATGAG TGATAGTGGT 550 

GTTATTAAGA AAACAGCTAT GGTCTTCGGA CAAATGAATG AGCCACCTGG 600 

TGCGCGTATG CGTGTTGCTT TATCAGGCTT AACAATGGCT GAACACTTCC 650 

45 GTGATGTACA AGGACAAGAT GTTTTACTAT TTATTGATAA CATATTCAGA 700 

TTTACGCAAG CTGGTTCAGA AGTATCAGCA CTATTAGGTC GTATGCCATC 750 

AGCCGTTGGT TATCAACCTA CCCTTGCTAC TGAAATGGGT CAATTACAAG 800 

AACGTATTAC ATCAACAACT AAAGGATCTG TAACGTC 837 



2 ) INFORMATION FOR SEQ ID NO: 246 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) . LENGTH: 851 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alcaligenes faecalis 

(B) STRAIN: ATCC 15554 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246 

TTCCCCCGCG ACAGCATCCC TAAAGTCTAC GAAGCATTGA CGCTCGTTGA 50 

CGAAAGTTCG GCTTTCGCAG AAAAAGGCCT GACTTTTGAA GTACAGCAAC 100 

10 AATTGGGTGA CGGTGTAGTT CGCACCATCG CCATGGGTTC CAGCGACGGC 150 

CTGCGCCGCG GTATGGAAGT GGCCGGTTCG GGCGCTCCCA TCTCCGTTCC 200 

CGTGGGTGTC GGCACCCTGG GTCGCATTAT GGACGTTCTG GGTCGCCCTA 250 

TTGACGAAGT CGGTCCTATT CAGTCCGACG AGCGTCGCGC CATTCACCAG 300 

CCTGCGCCTA CTTTCGACGA ACTGTCGCCT TCCGTAGAGC TGCTGGAAAC 350 

15 CGGTATTAAA GTGATTGACC TGGTTTGCCC GTTCGCCAAG GGTGGTAAGG 400 

TTGGTCTGTT CGGTGGTGCC GGTGTGGGCA AGACCGTGAA CATGCTGGAG 450 

CTGATCAACA ACATCGCCAA GGCACACAGC GGTCTGTCCG TGTTTGCCGG 500 

TGTGGGTGAG CGTACCCGTG AAGGTAACGA CTTCTACCAC GAAATGGCCG 550 

ATGCTGGCGT TATCCAGATG GACAACCTGA GCGAGTCCAA AGTGGCCATG 600 

20 GTGTTCGGTC AGATGAACGA ACCTCCAGGC AACCGTCTGC GTGTGGCACT 650 

GTCCGGCCTG ACCATGGCCG AGAAGTTCCG TGACGAAGGC CGTGACATCC 700 

TGTTCTTTGT GGACAACATC TACCGCTACA CGCTGGCCGG TACAGAAGTG 750 

TCCGCTCTGC TGGGTCGTAT GCCTTCCGCA GTGGGTTACC AGCCTACGCT 800 

GGCCGAGGAA ATGGGTAAGC TGCAAGAGCG CATTACCTCC ACCAAGACCG 850 

25 G 851 



2) INFORMATION FOR SEQ ID NO: 247 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 bases 

(B) TYPE:: Nucleic acid 

(C) STRANDEDNESS: Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Bacillus anthracis 

(B) STRAIN: 4229 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247 

15 GATGGCGGAA AGCTACCAGA AATCTACAAC GCCCTTACGG TAAAACAGAG 50 

CAACGAAAAC GGAACAAGCA TTAACTTAAC ATTTGAAGTT GCACTTCATT 100 

TAGGTGATGA CACAGTTCGT ACAGTTGCAA TGTCTTCCAC AGATGGACTT 150 

GTTCGTGGCA CAGAAGTAGA AGATACTGGT AAAGCAATCT CTGTACCAGT 200 

TGGTGATGCA ACACTTGGTC GTGTATTTAA CGTATTAGGT GATGCAATTG 250 

50 ACTTAGATGG TGAGGTTCCT GCGGATGTAC GTCGTGATCC AATTCACCGT 300 

CAAGCACCTG CATTCGAAGA ATTATCTACT AAAGTAGAAA TTCTTGAAAC 350 

TGGTATTAAA GTAGTAGACT TACTTGCTCC TTACATTAAG GGTGGTAAGA 400 

TCGGTCTATT CGGTGGTGCC GGTGTAGGTA AAACGGTATT AATTCAGGAA 450 

TTAATCAATA ACATCGCACA AGAACACGGT GGTATCTCTG TATTCGCTGG 500 

55 TGTAGGTGAG CGTACTCGTG AGGGTAATGA CTTATACCAC GAAATGAGCG 550 

ATTCTGGCGT AATTAAGAAA ACTGCGATGG TATTCGGACA AATGAACGAG 600 

CCACCTGGAG CACGTCAACG TGTTGCGTTA ACAGGTTTAA CAATGGCTGA 650 

GCATTTCCGT GATGAGCAAG GACAAGATGT ACTTCTGTTC ATCGATAATA 700 

TCTTCCGTTT CACGCAAGCA GGTTCTGAAG TATCTGCCCT TCTTGGCCGT 750 

50 ATGCCATCTG CGGTAGGTTA CCAACCAACA CTTGCAACAG AAATGGGTCA 800 
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ATTACAAGAG CGTATTACAT CTACAAATAA AGGGTCTATC ACGTCT 846 



2) INFORMATION FOR SEQ ID NO: 248 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 14579 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248 

CCAGAAATCT ACAATGCCCT TACGGTAAAA CAAAGCAACG AAAACGGAAG 50 

CATGAACTTA ACATTTGAAG TTGCACTTCA TTTAGGTGAT GATACAGTTC 100 

GTACAGTTGC GATGTCTTCC ACAGATGGAC TTGTTCGTGG CACAGAAGTA 150 

GAAGATACTG GTAAAGCAAT CTCTGTACCA GTTGGTGATG CAACACTTGG 200 

ACGTGTATTC AACGTATTAG GTGATGCAAT TGACTTAGAT GGTGAACTTC 250 

CTGCGGATGT ACACCGTGAT CCAATTCACC GTCAAGCACC TGCATTCGAA 300 

GAATTATCTA CTAAAGTAGA AATTCTTGAA ACTGGTATTA AAGTAGTAGA 350 

CTTACTTGCT CCTTACATTA AGGGTGGTAA GATCGGCCTA TTCGGTGGTG 4 00 

CCGGCGTAGG TAAAACAGTA TTAATTCAGG AGTTAATCAA TAACATCGCA 450 

CAAGAGCACG GTGGTATCTC TGTATTCGCT GGTGTAGGTG AGCGTACTCG 500 

TGAGGGTAAT GACTTATACC ACGAAATGAG CGATTCTGGC GTAATCAAGA 550 

AAACTGCGAT GGTATTCGGA CAAATGAACG AGCCACCTGG AGCACGTCAA 600 

CGTGTTGCAT TAACAGGTTT AACAATGGCT GAGCATTTCC GTGATGAGCA 650 

AGGACAAGAC GTACTTCTGT TCATCGATAA CATCTTCCGT TTCACGCAAG 700 

CGGGTTCTGA AGTATCTGCC CTTCTTGGTC GTATGCCATC TGCGGTAGGT 750 

TACCAACCAA CACTTGCAAC AGAAATGGGT CAATTACAAG AGCGTATTAC 800 

ATCTACAAAT 810 



2) INFORMATION FOR SEQ ID NO: 249 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 944 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides distasonis 

(B) STRAIN: ATCC 8503 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249 

CTTTGATTAT GGGACAGAAG AAACAGTAAC CCTCCCCCGT ATCCACGACG 50 

CCATGGAGAT TTCCCGCCCA AACGGAAAGA TCTTGATCGT CGAAGTTCAG 100 

CAACACATCG GGGAAAACAC CGTCCGTACC GTAGCGATGG ATACGACCGA 150 

CGGATTGAGA CGAGGCATGG AGGCCGTGTC ATACGGAATG CCCATCACCA 200 
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CGACCAAGTC 
GCATGGCCCA 
CCTAAATTCG 
TAAAGTAATC 
TTTTCGGAGG 
AACAACATCG 
TGAACGTACA 
GTGTCATCCG 
TGGGACTTGT 
CTTGGTATTC 
CCTTATCCGG 
GGAGAAAGAA 
CCAAGCCGGT 
TAGGTTACCA 
ATCACTTCAA 



AAAGGACGTT 
GCTTACTAAA 
AGGATCTGAC 
GATTTATTGG 
AGCCGGAGTC 
CAAAGAAAAA 
CGTGAAGGAA 
ATACGGCGAG 
CGAAGATCGA 
GGGCAGATGA 
TTTGACGATC 
AAGATATATT 
TCCGAGGTTT 
ACCGACATTG 
CCAAGAAGGG 



TAATGAATGT CACCGGCGAC 
GACGGGGCTC TTCCCATCCA 
AACGACCCAA GAGGTCTTGT 
AACCTTACGC CAAAGGAGGC 
GGCAAAACGG TATTGATCAT 
CAACGGATTC TCCGTCTTCG 
ATGACCTATT GCGTGAAATG 
GAGTTTAAGA AGAGTATGGA 
TTACGATGAA TTGGCTAAGT 
ACGAACCGCC GGGCGCCCGC 
GCCGAATCCT TCCGTGACAA 
ATTCTTCATC GATAATATCT 
CCGCCTTGCT GGGGCGTATG 
GCCACGGAAA TGGGAGCTAT 
CTCCATCACC TCCG 



250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
944 



2) INFORMATION FOR SEQ ID NO: 250 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 939 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides ovatus 
. (B) ' STRAIN: ATCC 8483 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250 

TTGAGGGTAC GGATGCAGAA TTGGTGCTGC CAAGCATCCA CGACGCACTG 50 

GAGATAAAGA GGCCAAACGG CAAAATACTG GTTGTAGAAG TTCAGCAACA 100 

TATCGGCGAA AATACGGTGC GTACCGTAGC GATGGACAGT ACTGACGGAC 150 

TTCAGAGAGG CATGAAAGTG TATCCCACCG GAGGCCCGAT CACGATGCCG 2 00 

ATTGGCGAAC AGATTAAAGG ACGACTGATG AACGTAGTCG GTGATTCGAT 250 

CGACGGTATG AAAGGACTCG ACCGCAAAGG TGCATATTCC ATTCATCGCG 300 

ACCCCCCTAA GTTTGAGGAT TTGACTACTG TGCAAGAGGT GCTCTTCACA 350 

GGTATCAAAG TGATCGACCT GCTCGAACCG TATGCCAAAG GTGGTAAAAT 400 

CGGTTTGTTC GGCGGTGCCG GTGTAGGAAA GACTGTATTG ATTCAGGAAC 450 

TTATCAATAA TATCGCCAAG AAACATAATG GATTCTCTGT ATTTGCCGGA 500 

GTAGGTGAAC GTACCCGTGA AGGTAACGAC TTGCTGCGCG AAATGATTGA 550 

ATCCGGTGTA ATCCGTTACG GCGAAGCATT CAAAGAAGGA ATGGAGAAAG 600 

GTCACTGGGA TCTTTCGAAA GTGGATTATA ACGAACTGGA GAAATCGCAA 650 

GTGTCTCTGA TTTTCGGTCA GATGAACGAG CCTCCGGGCG CACGTGCCTC 700 

TGTGGCATTG TCCGGACTGA CGGTGGCGGA ATCTTTCCGC GACGCAGGAA 750 

AAGAAGGTGA GAAACGCGAT ATTCTGTTCT TTATTGATAA TATCTTCCGT 800 

TTCACGCAAG CAGGTTCAGA AGTGTCCGCC CTTTTGGGAC GTATGCCCTC 850 

CGCTGTTGGT TACCAGCCCA CGTTGGCTAC GGAAATGGGT GCGATGCAGG 900 

AACGTATCAC GTCTACCCGC AAAGGTTCTA TCACCTCCG 93 9 



2) INFORMATION FOR SEQ ID NO: 251 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 833 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leclercia adecarboxylata 

(B) STRAIN: ATCC 23216 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251 

CGAATTCCCT CAGGATGCCG TACCGCGCGT GTACGATGCT CTTGAGGTTC 50 

AGAATGGTAA TGAGAGCCTG GTGCTGGAAG TTCAGCAGCA GCTCGGCGGC 100 

GGTATTGTGC GTACCATCGC CATGGGTTCT TCCGACGGTC TGCGTCGTGG 150 

TCTGGAAGTT AAAGACCTCG AGCACCCAAT CGAAGTACCA GTAGGTAAAG 200 

CAACCCTGGG CCGTATCATG AACGTCCTGG GTCAGCCGAT CGACATGAAA 250 

GGCGACATCG GCGAAGAAGA GCGTTGGGCT ATCCACCGTT CAGCACCTTC 300 

CTATGAAGAG CTGTCCAGCT CTCAGGAACT GCTGGAAACC GGCATCAAAG 350 

TTATCGACCT GATGTGTCCG TTCGCGAAGG GCGGTAAAGT TGGTCTGTTC 400 

GGCGGTGCGG GTGTAGGTAA AACCGTAAAC ATGATGGAGC TGATCCGTAA 450 

CATCGCGATC GAGCACTCCG GTTACTCCGT GTTTGCAGGC GTGGGTGAGC 500 

GTACTCGTGA GGGTAACGAC TTCTACCACG AAATGACCGA CTCCAACGTT 550 

CTGGACAAAG TATCCCTGGT TTACGGCCAG ATGAACGAGC CACCAGGAAA 600 

CCGTCTGCGC GTTGCGCTGA CCGGCCTGAC CATGGCTGAG AAGTTCCGTG 650 

ACGAAGGTCG TGACGTACTG CTGTTCGTTG ACAACATCTA CCGTTACACC 700 

CTGGCCGGTA CGGAAGTATC CGCACTGCTG GGTCGTATGC CATCAGCAGT 750 

AGGCTACCAG CCGACCCTGG CGGAAGAGAT GGGTGTTCTG CAGGAACGTA 800 

TCACCTCTAC CAAAACCGGT TCTATCACCT CCG 833 



2) INFORMATION 1 FOR SEQ ID NO: 252 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Stenotrophomonas maltophilia 

(B) STRAIN: CDC F3338 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252 

GTGTACGACG CACTGAAGGT GGAAAACACC GAGATCACCC TCGAAGTCCA 50 

GCAGCAGCTG GGCGACGGCG TGGTGCGTAC CATCGCCCTC GGTTCCACCG 100 

ACGGCCTGAA GCGCAACCTG GTTGCCGTCA ACACCGGCCG TGGCATCTCG 150 

GTGCCGGTCG GCGCCGGCAC CCTGGGCCGC ATCATGGACG TGCTGGGCCG 2 00 

TCCGATCGAC GAAGCCGGCC CGGTCGCTGC CAGCGACAAC TGGGAAATCC 250 

ACCGCGCTGC GCCGTCGTAT GAAGACCAGT CCCCGGCCAC CGAGCTGCTG 3 00 

GAAACCGGCA TCAAGGTCAT CGACCTGATG TGCCCGTTCG CCAAGGGCGG 350 

CAAGGTCGGC CTGTTCGGCG GCGCCGGCGT CGGCAAGACC GTCAACATGA 400 

TGGAACTGAT CAACAACATC GCCAAGGCGC ACAGCGGCCT GTCCGTGTTC 450 

GCCGGCGTGG GTGAGCGTAC CCGTGAGGGC AACGACTTCT ACCACGAGAT 500 

GAAGGACTCC AACGTCCTGG ACAAGGTGGC GATGGTGTAC GGCCAGATGA 550 

ACGAGCCGCC GGGCAACCGT CTGCGCGTCG CCCTGACCGG CCTGACCATG 600 
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GCCGAGTACT TCCGCGATGA GAAGGACGAA 
CCTGCTGTTC GTCGACAACA TCTACCGCTA 
TGTCGGCACT GCTGGGCCGC ATGCCGTCCG 
CTGGCCGAGG AAATGGGCGT CCTGCAGGAG 
TGGTTCGATC ACCTCGATC 



AACGGCAAGG GCAAGGACGT 
CACCCTGGCC GGTACCGAAG 
CGGTGGGTTA CCAGCCGACC 
CGCATCACCT CGACCAAGAA 



* * 

650 
700 
750 
800 
819 



10 



15 



20 



25 



30 



35 



15 



50 



55 



2) INFORMATION FOR SEQ ID NO: 253 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 864 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bartonella henselae 

(B) STRAIN: ATCC 49882 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253 

AGTTTGAAGG CCCTTTGCCA AATATTCTCA ATGCATTAGA AACAGATAAT 50 

TTGGGCAATC GGCTAGTTTT AGAAGTTGCT CAGCATTTGG GTGAAAATAC 100 

CGTGCGTACC ATTGCCATGG ATACTACCGA TGGTCTTGTC CGTGGTCAAA 150 

AAGTTTTTGA TACAGGAACA CAGATCAGTG TTCCCGTGGG AGAAGCAACA 2 00 

CTTGGTCGTA TTATGAATGT GATTGGAGAG CCGGTTGATA ATGTTGGCCC 250 

AATTGCTACA AGCAAAACCC GTTCCATTCA CCAAGAGGCT CCTGAATATG 3 00 

TGGAGCAATC AACCGCATCA GAAATCCTTG TGACTGGTAT TAAAGTCGTT 3 50 

GATCTGTTAG CTCCTTATTC TAAAGGGGGG AAGGTTGGTT TGTTTGGAGG 400 

TGCCGGTGTT GGTAAAACCG TTCTCATTAT GGAGCTTATC AACAATATTG 450 

CAAAGGCGCA TGGTGGCTAT TCAGTGTTTG CCGGTGTTGG TGAACGTACA 500~ 

CGTGAGGGAA ATGATCTTTA TTATGAAATG ATCGAAAGCC GTGTGAATGT 550 

GAATCCAAAA GACAACAATG GTTCAACAGA AGGATCAAAA TGTGCACTCG 600 

TTTATGGGCA AATGAATGAA CCACCAGGGG CGCGTGCACG TGTGGCTCTT 650 

TCAGGATTGA CCATTGCAGA AAGTTTCCGT GATGAGGGAC AAGATGTTTT 700 

GTTCTTCGTA GATAATATTT TCCGTTTTAC GCAAGCAGGC GCTGAAGTGT 750 

CAGCTCTTTT AGGGCGTATT CCTTCTGCTG TAGGGTATCA GCCAACTTTG 800 

GCAACTGATA TGGGGGCTTT GCAAGAGCGT ATTACCAGTA CAAGAACAGG 850 

CTCTATTACC TCTG 864 



2) INFORMATION FOR SEQ ID NO: 254 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 866 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bifidobacterium adolescentis 

(B) STRAIN: ATCC 15703 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254 
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TTCCCGGTGG GCCACCTGCC CGACATTTAC AATGCACTTA CCGTTGAACT 50 

GKCCAACACC GGYGTCCACG AASAGGGCGA GACCACCAAG AAGATCACCC 100 

TTGMGGTTGA ACAGCATCTT GGCGATTCCA CCGTGCGTAC CGTCGCTCTG 150 

5 AAGCCGACTG ACGGCCTTGT GCGTGGCGCC ACCGTGTATG ACACCGGCGG 200 

CCCGATCTCT GTGCCGGTTG GCGATGTCAC CAAGGGCCAC GTATTCGACG 250 

TGTCCGGCAA CATCCTCAAC AAGAAGGCCG ACGAGACCGT TAAGGTTACC 300 

GAACGTTGGT CTATCCACCG TAACCCGCCG GCATTCGACC AGCTGGAGTC 350 

CAAGACCCAG ATGTTCGAAA CCGGTATCAA GGTCATCGAT TTGCTGACCC 400 

10 CGTATGTGCA GGGCGGCAAG ATCGGTCTGT TCGGCGGCGC AGGCGTCGGC 450 

AAGACCGTGC TGATCCAGGA AATGATTCAG CGTGTGGCTC AGAACCACGG 500 

CGGTGTGTCC GTGTTCGCAG GCGTCGGCGA GCGTACCCGT GAGGGTAACG 550 

ATCTGATCGG CGAAATGGAC GAAGCCGGCG TGCTCGAGAA GACCGCACTG 600 

GTCTTCGGCC AGATGGATGA GCAGCCGGGT ACCCGTCTGC GCGTGCCGCT 650 

15 GACCGCACTG ACCATGGCAG AGTACTTCCG TGACGTACAG AATCAGGACG 700 

TGCTGCTGTT CATCGATAAC ATCTTCCGTT TCACCCAGGC TGGTTCCGAG 750 

GTGTCCACCC TGCTCGGCCG TATGCCGTCC GCAGTGGGCT ACCAGCCGAA 800 

CCTGGCCGAT GAGATGGGCG CGCTGCAGGA GCGAATCACT TCGACCCGTG 850 

GACAC TCCAT CACCTC 866 

20 



2) INFORMATION FOR SEQ ID NO: 255 



25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 842 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brucella abortus 
35 (B) STRAIN: S2308 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255 

AAGGCCAGCT GCCGCTGATC CTGAACGCGC TTGAAGTGGA CAATCAGGGC 50 

40 CATCGTCTGG TTCTCGAAGT TGCCCAGCAC CTCGGCGAAG ACACCGTGCG 100 

CACCATCGCC ATGGACGCGA CCGAAGGTCT CGTTCGCGGT CAGGAAGCAC 150 

GCGACACTGG CGAACCGATC ATGGTGCCGG TCGGCGTCGA AACGCTTGGC 200 

CGCATCATGA ACGTCATCGG CGAGCCGGTT GACGAAGCAG GCCCCATCAA 250 

GACCAAGGCA ACCCGCGCCA TCCACCAGAA CGCGCCGGAA TATATCGAAC 300 

45 AGTCGACCGA AGCCGAAATT CTGGTCACGG GCATCAAGGT CGTCGACCTT 350 

CTGGCGCCTT ACGCCAAGGG CGGCAAGATC GGCCTCTTCG GCGGTGCAGG 400 

CGTCGGCAAG ACCGTTCTCA TCATGGAACT CATCAACAAC GTCGCCAAGG 450 

CGCACGGCGG TTATTCCGTG TTCGCAGGCG TCGGTGAGCG TACCCGTGAG 500 

GGCAACGACC TTTACCACGA AATGATCGAG TCGGGCGTGA ACAAGCTCGG 550 

50 CGGCGGCGAA GGCTCCAAGG CAGCCCTCGT TTACGGCCAG ATGAACGAAC 600 

CCCCGGGTGC CCGCGCCCGC GTTGCCCTTT CCGGTCTGAC GGTTGCTGAA 650 

AACTTCCGTG ACCAGGGCCA GGACGTTCTG TTCTTCGTGG ACAACATCTT 700 

CCGCTTCACG CAGGCAGGTT CGGAAGTGTC GGCTCTTCTC GGCCGTATTC 750 

CTTCCGCTGT GGGTTATCAG CCGACGCTGG CAACCGACAT GGGCGCCATG 800 

55 CAGGAACGCA TCACCACGAC GACCAAGGGT TCGATCACCT CG 842 



2) INFORMATION FOR SEQ ID NO : 256 

60 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cedecea davisae 

(B) STRAIN: ATCC 33431 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256 

CGAGTTCCCT CAGGACGGCG TACCGCGCGT TKATGATGCG CTTGAAGTAC 50 

AAAATAACAG CGAGCAGCTG GTGCTGGAAG TTCAGCAGCA GCTCGGCGGC 100 

GGTATCGTGC GTACCATCGC TATGGGTTCT TCCGACGGTC TGCGTCGTGG 150 

TCTGGAAGTT AAAGACCTCG AGCACCCGAT CGAAGTCCCG GTAGGTAAAG 200 

CGACCCTTGG CCGTAT CATG AACGTGCTGG GTCAGCCGAT CGACATGAAA 250 

GGCGATATCG GCGAAGAAGA CCGTTGGGCT ATTCACCGCG CTGCACCTTC 300 

CTATGAAGAG CTGTCCAGCT CTCAGGAACT GCTGGAAACC GGCATCAAAG 350 

TAATCGACCT TATCTGTCCG TTCGCTAAGG GCGGTAAAGT AGGTCTGTTC 400 

GGTGGTGCGG GCGTGGGTAA AACCGTAAAC ATGATGGAGC TTATCCGTAA 450 

CATCGCGATC GAGCACTCCG GCTACTCCGT GTTTGCGGGC GTGGGTGAGC 500 

GTACTCGTGA GGGTAACGAC TTCTATCACG AAATGACCGA CTCCAACGTT 550 

CTGGACAAAG TTGCCCTGGT TTACGGCCAG ATGAACGAGC CACCGGGTAA 600 

CCGTCTGCGC GTAGCGCTGA CCGGTCTGAC CATCGCGGAG AAATTCCGTG 650 

ACGAAGGTCG TGACGTTCTG CTGTTCGTTG ATAACATCTA CCGTTACACC 700 

CTGGCCGGTA CTGAAGTATC CGCGCTGCTG GGTCGTATGC CTTCTGCGGT 750 

AGGTTACCAG CCAACTCTGG CGGAAGAGAT GGGTGTTCTT CAGGAGCGTA 800 

TTACCTCCAC CAAGACCGGT TCCATCACCT CCG 833 f 



2) INFORMATION FOR SEQ ID NO: 257 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cedecea lapagei 

(B) STRAIN: ATCC 33432 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257 

TTCCCTCAGG ACGGCGTACC GCGCGTATAT GACGCGCTTG AGGTACAGAA 50 

TAACAGCGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA 100 

TCGTACGTAC CATCGCAATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG 150 

GAAGTGAAAG ACCTCGAGCA CCCGATCGAA GTCCCGGTAG GTAAAGCGAC 200 

TCTGGGTCGT ATCATGAACG TGCTGGGTCA GCCAATTGAT ATGAAAGGCG 250 

ACATCGGCGA AGAAGATCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAT 300 

GAAGAGCTGT CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT 350 

CGACCTGATT TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG 4 00 

GTGCGGGCGT AGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCCGTGTTT GCAGGCGTGG GTGAGCGTAC 500 
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GACCGACTCC AACGTTCTGG 550 

ACGAGCCGCC AGGTAACCGT 600 

GCGGAGAAAT TCCGTGACGA 650 

CATCTATCGT TATACCCTGG 700 

GTATGCCATC TGCGGTAGGT 750 

GTTCTTCAGG AGCGTATTAC 800 

829 



10 



2) INFORMATION FOR SEQ ID NO: 258 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 830 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cedecea nefceri 

(B) STRAIN: ATCC 33855 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258 

TCCCTCAGGA CGGCGTACCG CGCGTTTATG ACGCGCTTGA GGTACAGAAC 50 

AACAATGAGA AGCTGGTGCT GGAAGTTCAG CAGCAGCTCG GCGGCGGTAT 100 

CGTGCGTACC ATCGCAATGG GTTCTTCCGA CGGTCTGCGT CGTGGTCTGG 150 

3 0 TAGTAACAGA CCTCGAGCAC CCGATCGAAG TCCCGGTAGG TAAAGCGACC 200 
CTTGGCCGTA TCATGAACGT GCTGGGTCAG CCGATCGACA TGAAAGGCGA ■ 250 

CATCGGCGAA GAAGACCGTT GGGCGATTCA CCGCGCAGCA CCTTCCTACG 300 

AAGAGCTGTC CAGCTCTCAG GAATTGCTGG AAACCGGCAT CAAAGTTATC 350 

GACCTGATTT GTCCGTTCGC TAAGGGCGGT AAAGTAGGTC TGTTCGGTGG 400 

35 TGCGGGCGTA GGTAAAACCG TAAACATGAT GGAGCTGATC CGTAACATTG 450 

CGATCGAGCA CTCCGGTTAT TCCGTGTTTG CGGGCGTGGG TGAGCGTACT 500 

CGTGAGGGTA ACGACTTCTA CCACGAAATG ACCGACTCCA ACGTTCTGGA 550 

TAAAGTAGCA CTGGTTTACG GCCAGATGAA CGAGCCACCA GGTAACCGTC 600 

TGCGCGTAGC GCTGACCGGT CTGACCATCG CGGAAAAATT CCGTGACGAA 650 

40 GGCCGCGACG TTTTGCTGTT CGTTGATAAC ATCTACCGTT ATACCCTGGC 700 

CGGTACCGAA GTATCTGCAC TGCTGGGTCG TATGCCTTCT GCGGTAGGTT 750 

ATCAGCCAAC TCTGGCAGAA GAGATGGGTG TTCTTCAGGA GCGTATTACC 800 

TCCACCAAGA CCGGTTCTAT CACCTCCGTA 830 

45 

2) INFORMATION FOR SEQ ID NO : 259 

(i) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 931 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chryseobacterium meningosepticum 

(B) STRAIN: CDC B7681 

50 

155 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259 

TTTCAGGAAG TAGAGGAACT ACCAAACATT TATGACGCAC TTGAAGTTGT 50 

CAGAGAAGGC CAGAAAGGTC TGATCTTAGA AGTTGAACAA CACATCGGTG 100 

5 AGGATACAGT GAGATGTATC GCTATGGATG CAACAGACGG ACTTCAAAGA 150 

GGACAATCTG TAATAGGACA TGGAAGACAA ATTACTATGC CTATTGGTGA 2 00 

AGAAGTAAAT GGTAGATTGT TCAACGTGGT AGGAGATGCT ATCGACGGAC 2 50 

TTCAGGAATT ATCTAAAGAC AATGGTTTGC CAATCCACAG AGAAGCACCT 300 

AAATTCGATC AGTTATCAAC TTCTGCTGAA GTACTATATA CAGGTATCAA 3 50 

10 AGTAATCGAC CTTATCGAGC CTTATGCAAA AGGTGGTAAA ATTGGTTTGT 4 00 

TCGGTGGTGC TGGTGTAGGT AAAACAGTAT TGATCCAGGA ATTGATTAAT 450 

AACATTGCTA AAGGACACGG TGGTCTTTCT GTATTCGCAG GAGTAGGTGA 500 

GAGAACAAGA GAAGGAAATG ACCTTCTTCG TGAGATGTTA GAGTCTGGTA 550 

TTATTAAATA TGGTGACGAA TTCATGCATT CTATGGAGAA CGGTGGATGG 600 

15 GATCTTTCTA AAGTTGACAG TGAGTTGATG AAAGAGTCTA AAGCTGCTTT 650 

CGTTTTCGGA CAGATGAACG AGCCACCAGG TGCAAGAGCA CGTGTAGCCC 700 

TTTCTGGTCT TACTTTAGCT GAATACTACC GTGATGGTGG CGAAAGCGGA 750 

CAAGGTAGAG ACGTTCTTTT CTTCGTAGAC AACATCTTCC GTTTTACACA 800 

GGCTGGTTCT GAGGTGTCTG CACTTCTAGG TCGTATGCCT TCAGCGGTAG 850 

20 GTTACCAACC AACTCTAGCT TCTGAGATGG GTGCAATGCA GGAGAGAATT 900 

ACTTCAACTA AAAACGGATC TATTAC AT CT G 931 



25 2) INFORMATION FOR SEQ ID NO:260 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 726 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY.: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ci trobacter amalonaticus 

(B) STRAIN: ATCC 25405 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260 



10 



GTATCGTACG TACCATCGCA ATGGGTTCTT CCGACGGTCT GCGTCGTGGT 50 

CTGGAAGTCA AAGACCTCGA GCACCCGATC GAAGTCCCGG TAGGTAAAGC 100 

AACCCTGGGT CGTATCATGA ACGTCCTGGG TCACCCGATC GACATGAAAG 150 

GCGATATCGG TGAAGAAGAG CGTTGGGCTA TCCACCGCGC AGCACCGTCC 200 

15 TATGAAGAGC TGTCCAGCTC TCAGGAACTG CTGGAAACCG GTATCAAAGT 250 

TATCGACCTG ATGTGTCCGT TCGCGAASGG CGGTAAAGTG GGTCTGTTCG 300 

GTGGTGCGGG TGTAGGTAAA ACCGTAAACA TGATGGAGCT TATCCGTAAC 350 

ATCGCGATCG AGCACTCCGG TTACTCCGTG TTTGCGGGCG TAGGTGAACG 4 00 

TACTCGTGAG GGTAACGACT TCTACCACGA AATGACCGAC TCCAACGTTC 450 

50 TGGATAAAGT ATCCCTGGTA TATGGCCAGA TGAACGAGCC GCCGGGAAAC 500 

CGTCTGCGCG TTGCACTGAC CGGTCTGACC ATGGCAGAGA AATTCCGTGA 550 

CGAAGGTCGT GACGTACTGC TGTTCGTCGA TAACATCTAT CGTTACACCC 600 

TGGCCGGTAC GGAAGTTTCC GCACTGCTGG GTCGTATGCC ATCAGCGGTA 650 

GGTTACCAGC CGACCCTGGC GGAAGAGATG GGTGTTCTGC AGGAACGTAT 700 

55 CACTTCTACC AAAACCGGTT CTATCA 726 



2) INFORMATION FOR SEQ ID NO: 261 

50 

156 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: CitroJbacter braakii 

(B) STRAIN: ATCC 43162 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261 

15 GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTTATGAATG GTAAAGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTACGTACCA 100 

TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGAT 150 

CTCGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACTC TGGGTCGTAT 2 00 

CATGAACGTC CTGGGTCACC CGATCGACAT GAAAGGCGAT ATCGGTGAAG 250 

20 AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CTTCCTATGA AGAGCTGTCC 300 

AGCTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCT AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAACAC 450 

TCCGGTTACT CCGTGTTTGC GGGCGTGGGT GAACGTACTC GTGAGGGTAA 500 

25 CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAT AAAGTATCCC 550 

TGGTATATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCT 600 

CTGACCGGTC TGACCATGGC AGAGAAGTTC CGTGACGAAG GTCGTGACGT 650 

TCTGCTGTTC GTTGATAACA TCTATCGTTA CACCCTGGCC GGTACAGAAG 700 

TATCCGCTCT GCTGGGTCGT ATGCCATCAG CGGTAGGCTA CCAGCCGACC 750 

30 CTGGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT CTACCAAAAC 800 

CGGTTCTATC AC 812 



35 2) INFORMATION FOR SEQ ID NO: 262 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 811 bases 

(B) TYPE: Nucleic acid 
*0 (C) STRANDEDNESS: Double 

(D> TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

^5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter koseri 

(B) STRAIN: ATCC 27156 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262 



50 



GCGCGTGTAC GACGCCCTTG AGGTGCAGAA TGGTAATGAA CATCTGGTGC 50 

TGGAAGTTCA GCAGCAGCTC GGTGGCGGTA TCGTACGTAC CATCGCCATG 100 

GGTTCTTCCG ACGGCCTGCG TCGTGGTCTG GATGTGAAAG ACCTTGAGCA 150 

CCCGATCGAA GTCCCGGTAG GTAAAGCAAC GCTGGGTCGT ATCATGAACG 200 

55 TACTGGGCGA ACCAGTAGAC ATGAAAGGCG AGATCGGTGA AGAAGAGCGT 250 

TGGGCTATCC ACCGTGCGGC ACCGTCCTAC GAAGAGTTGT CAAACTCTCA 3 00 

GGAACTGCTG GAAACCGGTA TCAAAGTTAT CGACCTGATG TGCCCGTTCG 350 

CGAAGGGCGG TAAAGTGGGT CTGTTCGGTG GTGCGGGTGT AGGTAAAACC 4 00 

GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATCGAAC ACTCCGGTTA 450 

50 CTCCGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT AACGACTTCT 500 
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ACCACGAAAT GACCGACTCC 
GGCCAGATGA ACGAGCCGCC 
CCTGACCATG GCGGAGAAAT 
TCGTCGACAA CATCTACCGT 
CTGCTGGGTC GTATGCCTTC 
AGAGATGGGT GTTTTGCAGG 
TCACCTCCGT A 



AACGTTATCG ACAAAGTATC CCTGGTTTAC 
GGGAAACCGT CTGCGCGTTG CGCTGACCGG 
TCCGTGACGA AGGTCGTGAC GTTCTGCTGT 
TACACCCTGG CCGGTACGGA AGTATCCGCA 
AGCGGTAGGT TACCAGCCGA CCCTGGCGGA 
AACGTATCAC CTCCACCAAA ACCGGTTCTA 



550 
600 
650 
700 
750 
800 
811 



2) INFORMATION FOR SEQ ID NO: 263 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: CitroJbacter farmeri 

(B) STRAIN: ATCC 51112 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263 

GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTTATGA ATGGTAAAGA 50 

GAGTCTGGTG CTGGAAGTTC AGCAGCAGCT CGGCGGCGGT ATCGTACGTA 100 

CCATCGCGAT GGGTTCTTCC GACGGTCTGC GTCGTGGTCT GGAAGTAAAA 150 

GACCTCGAAC ACCCGATCGA AGTCCCGGTA GGTAAAGCAA CTCTGGGTCG 2 00 

TATCATGAAC GTCCTGGGTC ACCCGATCGA CATGAAAGGC GATATCGGTG 250 

AAGAAGAGCG TTGGGCTATC CACCGCGCAG CGCCATCCTA TGAAGAGCTG 300 

TCCAGCTCTC AGGAACTGCT GGAAACCGGT ATCAAAGTTA TCGACCTGAT 350 

GTGTCCGTTT GCGAAGGGCG GTAAAGTCGG TCTGTTCGGT GGTGCGGGTG 400 

TAGGTAAAAC CGTAAACATG ATGGAACTCA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ACTCCGTGTT TGCGGGCGTA GGTGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGACCGATTC CAACGTTCTG GATAAAGTAT 550 

CCCTGGTATA TGGCCAGATG AACGAGCCGC CGGGAAACCG TCTGCGCGTT 600 

GCGCTGACCG GTCTGACCAT GGCAGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTACTGCTG TTCGTCGATA ACATCTATCG TTACACCCTG GCCGGTACGG 700 

AAGTATCCGC ACTGCTGGGC CGTATGCCAT CAGCGGTAGG CTACCAGCCA 750 

ACCCTGGCGG AAGAGATGGG TGTTCTGCAG GAACGTATCA CTTCTACCAA 800 

AACCGGTTCT ATTACC 816 



2) INFORMATION FOR SEQ ID NO: 264 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter freundii 

(B) STRAIN: ATCC 8090 
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WO 01/23604 PCT/CAOO/01150 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264 

AGGATGCCGT ACCGCGCGTG TACGATGCTC TTGAGGTTAT GAATGGTAAA 50 

GAGAGCCTGG TGCTGGAAGT TCAGCAGCAG CTCGGCGGCG GTATCGTACG 100 

5 TACCATCGCC ATGGGTTCTT CTGACGGTCT GCGTCGTGGT CTGGAAGTTA 150 

AAGACCTCGA GCACCCGATC GAAGTCCCGG TAGGTAAAGC AACGCTGGGT 200 

CGTATCATGA ACGTTCTGGG TCACCCGATC GACATGAAAG GCGATATCGG 250 

TGAAGAAGAG CGTTGGGCTA TCCACCGTGC AGCACCTTCC TACGAAGAGC 3 00 

TGTCAAGCTC TCAGGAACTG CTGGAAACCG GTATCAAAGT TATCGACCTG 350 

10 ATGTGTCCGT TCGCTAAGGG CGGTAAAGTT GGTCTGTTCG GTGGTGCGGG 400 

TGTAGGTAAA ACCGTAAACA TGATGGAGCT GATCCGTAAC ATCGCGATCG 450 

AACACTCCGG TTACTCCGTG TTTGCGGGCG TAGGTGAACG TACTCGTGAG 500 

GGTAACGACT TCTACCACGA AATGACCGAC TCCAACGTTC TGGACAAAGT 550 

ATCCCTGGTA TATGGCCAGA TGAACGAGCC GCCTGGAAAC CGTCTGCGTG 600 

15 TTGCGCTGAC CGGTCTGACC ATGGCTGAGA AGTTCCGTGA CGAAGGTCGT 650 

GACGTTCTGC TGTTCGTTGA TAACATCTAT CGTTACACCC TGGCCGGTAC 700 

AGAAGTATCT GCACTGCTGG GTCGTATGCC ATCAGCGGTA GGCTACCAGC 750 

CGACCCTGGC GGAAGAGATG GGTGTTCTGC AGGAACGTAT CACCTCCACC 800 

AAAACCGGTT CTATCACCT 819 

20 



2) INFORMATION FOR SEQ ID NO: 265 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) ' STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi ) ORIGINAL SOURCE : 

(A) ORGANISM: Ci trobacter koseri 
35 (B) STRAIN: ATCC 27028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265 

GATGCCGTAC CGCGCGTGTA CGACGCCCTT GAGGTGCAGA ATGGTAATGA 50 

40 ACATCTGGTG CTGGAAGTTC AGCAGCAGCT CGGTGGCGGT ATCGTACGTA 100 

CCATCGCCAT GGGTTCTTCC GACGGCCTGC GTCGTGGTCT GGATGTGAAA 150 

GACCTTGAGC ACCCGATCGA AGTCCCGGTA GGTAAAGCAA CGCTGGGTCG 200 

TATCATGAAC GTACTGGGCG AACCAGTAGA CATGAAAGGC GAGATCGGTG 250 

AAGAAGAGCG TTGGGCTATC CACCGTGCGG CACCGTCCTA CGAAGAGTTG 300 

45 TCAAACTCTC AGGAACTGTT GGAAACCGGT ATCAAAGTTA TCGACCTGAT 350 

GTGTCCGTTC GCGAAGGGCG GTAAAGTGGG TCTGTTCGGT GGTGCGGGTG 400 

TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAA 450 

CACTCCGGTT ACTCCGTGTT TGCGGGCGTA GGTGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGACCGACTC CAACGTTATC GACAAAGTAT 550 

50 CCTTGGTTTA CGGCCAGATG AACGAGCCGC CGGGAAACCG TCTGCGCGTT 600 

GCGCTGACCG GCCTGACCAT GGCGGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTTCTGCTG TTCGTCGACA ACATCTACCG TTACACCCTG GCCGGTACGG 700 

AAGTATCCGC ACTGCTGGGT CGTATGCCTT CAGCGGTAGG TTACCAGCCG 750 

ACCCTGGCGG AAGAGATGGG TGTTTTGCAG GAACGTATCA CCTCCACCAA 800 

55 AACCGGTTCT ATCACCTCCG TA 822 



2) INFORMATION FOR SEQ ID NO: 266 

60 

159 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Ci troJbacter sedlakii 

(B) STRAIN: ATCC 51115 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266 

15 GATGCCGTAC CGCGCGTGTA CGACGCCCTT GAGGTACAGA ATGGTAATGA 50 

GCGTCTGGTG CTGGAAGTTC AGCAGCAGCT CGGTGGCGGT ATTGTACGTA 100 

CCATCGCCAT GGGTTCTTCC GACGGTCTGC GTCGTGGTCT GGAAGTAAAA 150 

GACCTTGAGC ACCCGATCGA AGTCCCGGTA GGTAAAGCAA CGCTGGGTCG 200 

TATCATGAAC GTACTGGGCG AACCAGTAGA CATGAAAGGC GACATCGGTG 250 

20 AAGAAGAGCG TTGGGCTATC CACCGTGCCG CGCCGTCCTA TGAAGAGTTG 300 

TCTAACTCTC AGGAACTGCT GGAAACCGGC ATCAAAGTTA TCGACCTGAT 350 

GTGTCCGTTC GCGAAGGGCG GTAAAGT CGG TCTGTTCGGT GGTGCGGGCG 400 

TAGGTAAAAC CGTAAACATG ATGGAGC TGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ACTCTGTGTT TGCGGGCGTG GGTGAACGTA CTCGTGAGGG 500 

25 TAACGACTTC TACCACGAAA TGACCGACTC CAACGTTATC GACAAAGTAT 550 

CCCTGGTGTA CGGCCAGATG AACGAGCCGC CTGGAAACCG TCTGCGCGTC 600 

GCACTGACCG GTCTGACCAT GGCTGAGAAG TTCCGTGACG AAGGTCGTGA 650 

CGTTCTGCTG TTCGTCGATA ACATCTATCG TTACACCCTG GCCGGTACGG 700 

AAGTATCCGC ACTGCTGGGT CGTATGCCTT CAGCGGTAGG TTATCAGCCG 750 

3 0 ACTCTGGCGG AAGAGATGGG TGTTCTGCAG GAACGTATCA CCTCAACCAA 800 

AACCGGTTCT ATCACCTCCG 820 



35 2) INFORMATION FOR SEQ ID NO: 267 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 06 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter werkmanii 

(B) STRAIN: ATCC 51114 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267 



GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTTATGAATG GTAAAGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTACGTACCA 100 

TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGAC 150 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACCC TGGGTCGTAT 200 

55 CATGAACGTC CTGGGTCATC CGATCGACAT GAAAGGCGAT ATCGGTGAAG 250 

AAGAGCGTTG GGCTATCCAC CGCGCAGCAC CTACCTATGA AGAACTGTCC 300 

AGTTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCG AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAACAC 450 

50 TCCGGTTACT CAGTGTTTGC GGGCGTTGGT GAACGTACTC GTGAGGGTAA 500 

160 
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CGACTTCTAC CACGAAATGA CCGACTCCAA 
TGGTATATGG CCAGATGAAC GAGCCGCCGG 
CTGACCGGTC TGACCATGGC TGAGAAGTTC 
TCTGCTGTTC GTTGATAACA TCTATCGTTA 
5 TATCTGCACT GCTGGGTCGT ATGCCATCAG 
CTGGCGGAAG AGATGGGTGT TCTGCAGGAA 
CGGTTC 



PCJ/CA00/01150 

CGTTCTGGAC AAAGTATCCC 550 

GAAACCGTCT GCGCGTTGCG 600 

CGTGACGAAG GTCGTGACGT 650 

CACCCTGGCC GGTACTGAAG 700 

CGGTAGGCTA CCAGCCAACC 750 

CGTATCACCT CTACCAAAAC 8 00 

806 



10 



2) INFORMATION FOR SEQ ID NO: 268 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 810 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



( i i ) MOLECULE TYPE : Genomi c DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter youngae 

(B) STRAIN: ATCC 29935 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268 

GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTTATGAATG GTAAAGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTACGTACCA 100 

TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGAC 150 

30 CTCGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACGC TGGGTCGTAT 2 00 

CATGAACGTT CTGGGTCACC CGATCGACAT GAAAGGCGAT ATCGGAGAAG 2 50 

AAGAGCGTTG GGCTATTCAC CGCGCAGCAC CTTCCTATGA AGAGCTGTCC .3 00 

AGCTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCT AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

35 GTAAAACCGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CTGTGTTTGC GGGTGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGATTCCAA CGTTCTGGAT AAAGTATCCC 550 

TGGTTTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGTC TGACCATGGC TGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

10 ACTGCTGTTC GTCGATAACA TCTATCGTTA TACCCTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGTCGT ATGCCATCAG CGGTAGGTTA CCAGCCGACC 750 

CTGGCGGAAG AGATGGGTGT TCTGCAGGAA CGTATCACTT CTACCAAAAC 800 

CGGTTCTATC 810 

15 

2) INFORMATION FOR SEQ ID NO: 269 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium innocuum 

(B) STRAIN: ATCC 14501 



50 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269 



PCT/CAOO/01150 



TTGAGAACGG AGATTTGCCG CAGCTATTGA CCGCTATTGA AATTCCTCTG 50 

AAAGACAGTG AATCTCTGAT TGTCGAAGTT GCTCAGCATA TCGGTGATGA 100 

5 ACGTGTCCGC TGTATCGCTA TGGGCGGTAC AGATGGTCTG GTTCGTGGAA 150 

TGGAAGCCAT TGATACAGGA TCCGCAATCC GTGTACCGGT GGGAAAAGAA 200 

ATTCTGGGAA GAATGTTCAA TGTCCTCGGA CGTGAAATTG ATGGTCTGGG 250 

ACCTGTAGGA ACGGATAACA CACTGCCGAT CCACAGACAG GCACCGGGCT 300 

TTGAGGAGCA GCAGACATCC GCAGAAATGC TGGAAACAGG AATTAAGGTC 350 

10 ATTGACCTGT TATGTCCATA TTCCAAGGGT GGTAAGATTG GTTTGTTTGG 400 

TGGTGCGGGA GTAGGTAAAA CCGTACTGAT TCAGGAGCTG ATTCATAATA 450 

TCGCCAAGGA ACATGGTGGA ATGTCCGTCG TTACCGGTGT AGGGGAGAGA 500 

ACCCGTGAAG GAAAC G AC AT GTATCATGAA ATGAAGGACA GCGGTGTCCT 550 

TGATAAGACC GTACTGGTTT ACGGACAGAT GAATGAATCA CCGGGTGCCA 600 

15 GAATGCGTGT CGGTCTGACC GGGCTGACGA TGGCGGAATA TTTCCGTGAT 650 

CACGACCATC AGGATGTATT GCTGTTTATT GATAATATTT TCCGTTTTAC 700 

CCAGGCGGGA AGTGAAGTAA GTGCCCTGCT GGGACGTATG CCAAGTGCAG 750 

TAGGCTATCA GCCGACACTT GCGACAGAAA TGGGACAGCT GCAGGAGCGC 800 
ATTACATCCA CGAAGGATGG TTCCATT . 827 

20 



2) INFORMATION FOR SEQ ID NO: 270 



25 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: .Clostridium perfringens 
35 (B) STRAIN: ATCC 13124 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270 

GCAAATGATG AACTTCCTAA TATATTTAAT GCAATACACA TAAAAATGGA 50 

40 CGATGGAAAA ATCTTAGTTT GTGAGGTAGA GCAACACGTA GGAGACGATA 100 

TAGTTAGAAC TATAGCTATG GAAGCTACTG AAGGAC TAAG AAGAGGTGTA 150 

GAAGCTGTTG ATACAGGAGC ACCTATATCA GTACCAGTTG GTGAATGCGT 200 

ATTAGGAAGA ATATTTAACG TATTAGGTAA ACCACTAGAT AGTGGAGCTG 250 

AAGTTAATAA CGAAGAAAAA TATCCAATTC ATAGACCAGC TCCATCATTT 300 

15 GAAGAACAAT CAGTTGTTCC TCAAATGTTT GAGACAGGAA TAAAGGTTAT 350 

CGACCTTTTA GCACCTTACC AAAGAGGGGG AAAAATCGGT CTATTTGGAG 400 

GTGCAGGTGT TGGTAAAACA GTTCTTATCC AAGAGCTTAT AAACAACATA 450 

GCTAAAGAGC ACGGTGGACT TTCTGTATTC ACAGGAGTTG GAGAAAGATC 500 

AAGAGAAGGT AATGACCTTT ACTATGAAAT GATGGAATCA GGAGTTATAA 550 

50 AAAATACAGC ATTAGTATTT GGACAAATGA ACGAACCACC TGGAGCAAGA 600 

ATGAGAGTTG CTTTAACAGG ACTTACTATG GCTGAGTACT TCAGAGACCA 650 

AGGTCAAGAC GTGTTATTAT TCATAGATAA CATATTCAGA TTCTCACAAG 700 

CTGGATCAGA GGTTTCAGCT TTATTAGGAA GAATACCATC AGCTGTTGGT 750 

TACCAACCAA CTCTTGCTAC AGAGATGGGA GCTCTTCAAG AGAGAATCAC 800 

55 ATCAACTACC CATGGATCAA TTACATCAG 829 



2) INFORMATION FOR SEQ ID NO: 271 
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( i ) SEQUENCE CHARACTER I STIC S : 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271 
TTGTCCACGT TGGATRTCTT CA 



2) INFORMATION FOR SEQ ID NO: 272 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium diphtheriae 

(B) STRAIN: ATCC 27010 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272 

CCCGCGTGGC GAGGTGCCGG CACTGTACAA CGCGTTGACT GTCGAGGTCA 50 

CCCTCGAGGC AGTCGCTAAG ACCATTACCC TTGAGGTTGC CCAGCACTTG 100 

GGCGACAACC TCGTTCGCGC CGTGTCCATG GCCCCTACCG ACGGCCTCGT 150 

CCGTGGTGCT GTTGTGACCG ACTCGGGCAA GCCAATCTCC GTGCCAGTTG 200 

GCGACGTTGT TAAAGGCCAC GTTTTCAACG CACTGGGCGA TTGCTTGGAT 2 50 

GAGCCAGGTC TCGGCCGCGA TGGTGAGCAG TGGGGAATTC ACCGCGATCC 300 

ACCACCATTC GATCAGCTCG AAGGTAAGAC CGAAATCCTC GAGACCGGTA 350 

TTAAGGTCAT CGACTTGCTC ACCCCTTACG TTAAGGGCGG CAAGATTGGT 400 

CTGTTCGGTG GTGCAGGTGT GGGTAAGACC GTGCTCATCC AGGAGATGAT 450 

CACTCGTATT GCTCGCGAGT TCTCCGGTAC CTCCGTCTTC GCTGGCGTTG 500 

GTGAGCGTAC CCGTGAGGGC ACCGACCTCT TCCTCGAAAT GGAAGAAATG 550 

GGCGTTCTTC AGGACACCGC TCTCGTGTTC GGCCAGATGG ACGAGCCACC 600 

AGGAGTCCGT ATGCGCGTTG CTCTGTCCGG TCTGACCATG GCGGAGTACT 650 

TCCGCGATGT TCAGCACCAG GACGTGCTTC TGTTCATCGA TAACATTTTC 700 

CGTTTCACCC AGGCCGGTTC CGAGGTTTCG ACCCTTCTTG GTCGTATGCC 750 

TTCCGCCGTG GGTTACCAGC CAACCTTGGC TGACGAGATG GGTGTTCTCC 800 

AGGAGCGTAT TACCTCTA 818 



2) INFORMATION FOR SEQ ID NO: 273 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium pseudodiphtheriticum 

(B) STRAIN: ATCC 10700 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273 

CGATGCCTGC TCTGTACAAC GCGCTGACTG TCGAGGTCAC CCTCGAGGCA 50 

GTCGCCAAGA CCATCACGCT TGAGGTTGCA CAGCACCTCG GCGATAACCT 100 

GATCCGGACC ATTGCGTTGG CACCTACGGA CGGTCTCGTC CGTGGCGCTG 150 

10 AGGTTATCGA CACTGGTAAG CCAATTACTG TTCCCGTCGG CGATGCCGTC 200 

AAAGGACACG TCTTCAATGC GCTCGGTGAG TGTTTGGACG AACCAGGATT 250 

GGGCCGCGAC GGCGAACAGT GGGGAATCCA CCGCGATCCG CCACCATTCG 300 

ATGCGCTGGA GGGCAAAACC GAGATTCTGG AGACTGGAAT CAAGGTTATC 350 

GACCTCCTTA CCCCTTACGT TAAGGGTGGC AAAATTGGTC TGTTCGGTGG 400 

15 CGCCGGCGTC GGCAAGACCG TTCTTATCCA GGAAATGATC ACTCGTATCG 450 

CTCGTAACTT CTCCGGTACT TCCGTGTTCG CCGGCGTCGG TGAGCGTACC 500 

CGTGAGGGTA CTGACCTGTT CCTGGAAATG GAAGAGATGG GCGTGTTGCA 550 

AGACACCGCC CTTGTCTTCG GTCAAATGGA CGAACCACCA GGGGTTCGTA 600 

TGCGCGTGGC CTTGTCTGGT CTAACCATGG CTGAATATTT CCGCGACGTT 650 

20 CAAAACCAGG ACGTTTTGTT GTTCATTGAC AACATCTTCC GTTTTACTCA 700 

GGCAGGTTCC GAGGTTTCCA CGCTGTTGGG CCGTATGCCT TCCGCCGTGG 750 

GTTATCAGCC AACATTGGCT GATGAGATGG GTGTTTTGCA GGAACGGATT 800 

ACCTCTACAC GTGGTAAGTC AATTACTTCC CTG 833 

25 

2) INFORMATION FOR SEQ ID NO: 274 

(i) SEQUENCE CHARACTERISTICS : 
30 (A)'" LENGTH: 417 bases 

(B) T TYPE : Nucleic acid 

(C) STRANDEDNESS : Double 

(D) 'TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium ulcerans 

(B) STRAIN: NCTC 8665 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274 

CCGTGCTCAT CCAGGAGATG ATCACCCGTG TGGCCCGCAA CTTCGGCGGC 50 

ACCTCTGTGT TCGCCGGCGT CGGCGAGCGC ACCCGTGAGG GCAACGACCT 100 

15 CTGGGTCGAG ATGGACGAGG CCGACGTGCT CAAGGACACC GCCCTGGTGT 150 

TCGGCCAGAT GGACGAGCCG CCGGGAACCC GTCTGCGCGT GGCCCTGTCC 2 00 

GCGCTGACCA TGGCGGAGTA CTTCCGCGAT GTGCAGAACC AGGACGTGCT 2 50 

GCTGTTCATC GACAACATCT TCCGCTTCTC CCAGGCCGGC TCCGAGGTCT 3 00 

CCACCCTGCT GGGCCGCATG CCCTCCGCGG TGGGCTACCA GCCGAACCTG 3 50 

50 GCGGACGAGA TGGGTGTGCT GCAGGAGCGC ATCACCTCGA CTCGCGGCCA 4 00 

CTCCATCACC TCGATGC 417 



55 2) INFORMATION FOR SEQ ID NO: 275 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS : Double 
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(ii) MOLECULE TYPE: Genomic DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium urealyticum 

(B) STRAIN: ATCC 43042 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275 

L0 

GGGCAGCAGC CAGCACTATT CAACGCGCTG CACGTCGAGG TTGACCTCGA 50 

GGCAGTTGCG AAGACCATTA CCCTGGAGGT CGCACAGCAC CTGGGTGACA 100 

ACCTGGTGCG CACCGTCTCC ATGGCCCCGA CCGACGGCCT GGTCCGCGGT 150 

GCAGAGGTCA AGGACACCGG TAAGCCGATC TCTGTGCCAG TCGGCGATGT 2 00 

15 TGTCAAGGGG CACGTCTTCA ACGCCCTGGG CGACTGCCTG GATGAGCCAG 250 

GTCTCGGCCG CGACGGCGAG CAGTGGGGCA TCCACCGCGA GCCACCGGCA 300 

TTCGACGAGC TCGAGGGTAA GACCGAGATC CTGGAGACCG GCGTTAAGGT 350 

CATCGACCTG CTGACCCCTT ACGTCAAGGG CGGCAAGATT GGCCTCTTCG 400 

GTGGTGCAGG TGTGGGTAAG ACCGTCCTGA TTCAGGAGAT GATTACCCGT 4 50 

20 ATCGCCCGCG AGTTCTCCGG TACCTCCGTG TTCGCCGGCG TCGGCGAGCG 500 

TACCCGTGAG GGTACGGACC TCTTCCTCGA GATGGAGGAG ATGGGCGTGC 550 

TCCAGGACAC CGCGCTGGTG TTCGGTCAGA TGGATGAGCC GCCGGGAGTC 600 

CGTATGCGCG TGGCTCTGTC CGGTCTGACC ATGGCGGAGT ACTTCCGCGA 650 

TGTTCAGGGC CAGGACGTGC TGCTGTTCAT CGACAACATC TTCCGTTTCA 700 

25 CCCAGGCAGG TTCTGAGGTC TCCACGCTGC TCGGCCGCAT GCCGTCCGCA 750 

GTGGGTTACC AGCCGACCCT GGCTGACGAG ATGGGTGTTC TGCAGGAGCG 800 

CATTACCTCC ACGAAGGGTA AGTC CATTAC CTCCC 835 



30 



10 



2) INFORMATION FOR SEQ* ID NO: 276 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 818 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coxiella burnetii 

(B) STRAIN: Nine Mile phase II 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276 

CCCGTCACGC TGTCCCGAAA GTGTACGATG CTTTGCAGGT GGATGAAAAT 50 

AATTTAACGC TCGAAGTCCA ACAGCAACTC GGGGACGGTG TCGTGCGCAC 100 

AATTGCCATG GGCAGCACTG AGGGCTTAAA ACGCGATATC GCCGTAAAAA 150 

50 ATACGGAAAA ACCGATTGAA GTTCCCGTAG GAAAAGAAAC TTTAGGTCGT 2 00 

ATCATGAACG TGCTGGGTGA GCCGATCGAT GAGTTAGGTC CCATTAATTC 250 

AAAAGAAAAA CTCCCTATTC ATCGTCCTGC GCCGAGCTTT ATTGAGCAAT 3 00 

CTGGCGCTAC CGAATTATTA GAAACCGGTA TTAAAGTGGT CGATTTGCTT 3 50 

TGCCCCTTTG CTAAGGGAGG CAAAGTGGGT CTTTTTGGAG GCGCGGGCGT 4 00 

55 TGGAAAAACG GTTAATATGA TGGAATTAAT CCGTAATATC GCCATTGAAC 450 

ACAGCGGTTA TTCTGTTTTT GCGGGTGTGG GAGAAAGAAC GCGAGAAGGC 500 

AATGATTTTT ATCATGAAAT GAAAGAATCC AATGTCTTGG ATAAAGTGGC 550 

GTTGGTGTAC GGACAAATGA ACGAGCCGCC AGGGAACCGC TTGCGGGTGG 600 

GTTTGACGGG GCTTACGCTG GCGGAAGCCT TCCGTGACGA AGGACGCGAC 650 

50 GTTCTGTTAT TTATCGATAA TATCTTTCGT TACACTTTGG CAGGGGTTGA 700 
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AGTCTCTGCC CTCCTCGGTC GGATGCCATC GGCTGTGGGT TATCAGCCGA 750 
CGTTGGCCGA AGAGATGGGG GCCCTGCAAG AACGCATTAC TTCCACTAAA 800 
AAAGGGTCCA TTACGTCG 818 



2) INFORMATION FOR SEQ ID NO: 277 

(i) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Edwardsiella hoshinae 

(B) STRAIN: ATCC 33379 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277 



TTCCCGCAGG ACGCCGTGCC GAAGGTGTAT AACGCACTGG AAGTAAAAGG 50 

CGGTGCCACG AAACTGGTAC TGGAAGTGCA GCAGCAGCTG GGTGGCGGCG 100 

25 TAGTTCGCTG CATCGCGATG GGCTCTTCCG ACGGTCTGCG CCGTGGGCTA 150 

GAGGTTGAAG ACCAAGACCA TCCGATCGAG GTTCCTGTTG GCAAGGCGAC 2 00 

TCTGGGCCGT ATCATGAACG TACTGGGTGA TCCGGTCGAC ATGAAGGGCG 2 50 

AGATCGGTGA AGAAGAGCGT TGGGCTATCC ATCGTGCTGC ACCGAGCTAT 3 00 

GAAGATCTGT CTAACTCTCA GGAACTGCTG GAGACCGGCA TTAAGGTTAT 3 50 

3 0 CGACCTGATT TGCCCGTTCG CTAAAGGCGG TAAAGTGGGC CTGTTCGGTG 4 00 

GGGCCGGTGT GGGTAAGACC GTTAACATGA TGGAGCTTAT CCGTAACATC 450 

GCTATCGAGC ACTCCGGTTA CTCAGTCTTC GCCGGTGTGG GTGAGCGTAC 500 

CCGTGAGGGT AACGACTTCT ACCACGAGAT GACCGATTCC AACGTATTGG 550 

ATAAAGTTTC TCTGGTGTAT GGTCAGATGA ACGAGCCACC GGGAAACCGT 600 

35 CTGCGCGTGG CGCTGACCGG TCTGACCATG GCGGAGAAAT TCCGTGATGA 650 

AGGTCGTGAT GTACTGTTGT TCATCGATAA CATCTACCGT TATACCTTGG 700 

CCGGTACTGA AGTCTCCGCT CTGCTGGGCC GTATGCCGTC GGCGGTAGGT 750 

TATCAGCCGA CTCTGGCGGA GGAAATGGGG GTGCTGCAAG AGCGTATTAC 800 

CTCCACTAAG ACCGGGTCCA TCACCTCTG 829 

40 



2) INFORMATION FOR SEQ ID NO: 278 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 809 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Edwardsiella tarda 
55 (B) STRAIN: ATCC 15947 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278 

GCCGTGCCGA AGGTGTATAA CGCACTGGAA GTAAAAGGCG GTGCCACGAA 50 
50 ACTGGTACTG GAAGTGCAGC AGCAGCTGGG TGGCGGCGTC GTTCGCTGCA 100 
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TCGCGATGGG CTCCTCCGAC GGTCTGCGCC GTGGGCTGGT TSGTTGKAGKCr ' rfsfe 

CAAGACCATC CGATCGAGGT TCCGGTCGGT AAGGCGACCC TGGGCCGTAT 2 00 

CATGAACGTA CTGGGTGATC CGGTCGACAT GAAGGGCGAG ATCGGCGAAG 2 50 

AAGAGCGTTG GGCTATCCAC CGCGCGGCGC CGAGCTATGA AGATCTGTCC 300 

5 AACTCTCAGG AGCTGCTGGA GACCGGCATC AAGGTTATCG ACCTGATTTG 350 

CCCGTTCGCC AAAGGCGGTA AAGTTGGCCT GTTCGGTGGT GCCGGTGTGG 400 

GTAAGACCGT TAACATGATG GAGCTTATCC GTAACATCGC TATCGAGCAC 450 

TCCGGTTACT CCGTATTTGC CGGTGTAGGC GAGCGTACCC GTGAGGGTAA 500 

CGACTTCTAC CACGAGATGA CCGACTCCAA CGTATTGGAT AAAGTTTCTC 550 

10 TGGTATACGG CCAGATGAAT GAGCCGCCGG GAAACCGTCT GCGTGTGGCG 600 

TTGACCGGTC TGACCATGGC GGAGAAATTC CGTGATGAAG GTCGCGATGT 650 

GTTGTTGTTC ATCGATAACA TTTATCGTTA TACCTTGGCT GGTACCGAAG 700 

TTTCTGCTCT GCTGGGTCGT ATGCCGTCGG CGGTAGGTTA TCAGCCGACC 750 

CTGGCGGAAG AGATGGGTGT GTTGCAAGAG CGTATCACCT CAACGAAGAC 800 

15 GGGCTCTAT 809 



20 



2) INFORMATION FOR SEQ ID NO: 279 



( i ) SEQUENCE CHARACTERI ST I C S : 

(A) LENGTH: 84 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Eikenella corrodens 

•(B) STRAIN: ATCC 23834. 

(xi)SEQUENCE DESCRIPTION: SEQ ID NO: 279 

35 TTTCCGCGTA CCGCCATTCC GCGTGTTTAC GATGCACTCA AACTGGTTGA 50 

TACTGATTTG ACGCTGGAAG TACAGCAGCA GCTTGGTGAC GGCGTTGTCC 100 

GTACCATTGC GATGGGTAGT ACAGACGGTT TGAAACGTGG CTTGGCTGTG 150 

CAAAATACTG GTGCACCGAT TACTGTGCCG GTGGGGAAAG CCACCCTAGG 200 

CCGCATCATG GACGTTCTGG GCAACCCCGT GGACGAACAA GGTCCGATCG 250 

40 GTTCTGACCA AACCCGTGCT ATTCATCAAT TTGCACCTAA GTTCGACGAA 3 00 

CTCTCCAGCA CTACCGAATT GTTGGAAACA GGCATTAAAG TGATCGATTT 350 

GCTTTGTCCG TTTGCTAAAG GTGGTAAAGT GGGTCTGTTT GGCGGTGCCG 400 

GTGTGGGCAA AACCGTGAAC ATGATGGAGC TGATTAACAA CATTGCCAAA 450 

GCGCACAGTG GTCTTTCCGT ATTCGCCGGT GTGGGTGAGC GTACTCGTGA 500 

45 AGGTAACGAC TTCTACCACG AAATGAAAGA CTCCAACGTG TTGGATAAAG 550 

TGGCAATGGT GTATGGCCAG ATGAACGAGC CGCCTGGTAA CCGCTTGCGT 600 

GTTGCTCTAA CTGGTTTGTC GATGGCTGAA TACTTCCGTG ACGAAAAAGA 650 

CGAAAACGGC AAAGGCCGTG ACGTATTATT CTTTGTGGAT AATATCTACC 700 

GCTATACTCT GGCGGGTACC GAAGTGTCGG CTCTGCTTGG CCGTATGCCT 750 

50 TCTGCTGTGG GTTATCAGCC AACTTTGGCT GAAGAAATGG GTCGTTTGCA 800 

GGAGCGTATT ACCTCCACCC AAACTGGTTC CATTACCTCT 840 



55 2) INFORMATION FOR SEQ ID NO: 280 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: En terobacter agglomerans 

(B) STRAIN: ATCC 27989 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280 



L0 



GCCGTACCAC GAGTGTACGA TGCACTTGAG GTAAAGAATG GTGAAGAGCG 50 

TCTGGTGCTG GAAGTTCAGC AACAGCTCGG CGGTGGCGTT GTACGTACCA 100 

TCGCAATGGG TTCTTCTGAT GGTCTGCGTC GTGGTCTGGA AGTAACGGAC 150 

CTGGCTCACC CGATCGAAGT CCCGGTAGGT AAAGCAACAC TGGGTCGTAT 200 

15 CATGAACGTA CTGGGCGAAC CAGTAGACAT GAAAGGCGAC ATCGGTGAAG 250 

AAGAGCGTTG GGCGATCCAC CGTGCAGCAC CGTCCTACGA AGAGTTGTCA 300 

AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCT AAGGGCGGTA AAGTGGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 450 

20 TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAC AAAGTATCCC 550 

TGGTGTATGG TCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGTC TGACCATGGC TGAGAAGTTC CGTGACGAAG GTCGTGACGT 650 

ACTGTTGTTC GTTGACAACA TCTACCGTTA CACCCTGGCC GGTACGGAAG 700 

25 TATCCGCACT GCTGGGCCGT ATGCCTTCTG CGGTAGGTTA TCAGCCGACG 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAAAC 800 

CGG 803 



30 



2 ) INFORMATION FOR SEQ ID NO: 281 



' (i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 833 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: En terobacter amnigenus 

(B) STRAIN: ATCC 33072 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281 

CGAATTCCCT CAGGATGCCG TACCGCGCGT GTACGATGCT CTTGAGGTAC 50 

AGAATGGTAA CGAGAGTCTG GTGCTGGAAG TTCAGCAGCA GCTCGGTGGT 100 

GGTATCGTAC GTACTATCGC CATGGGTTCT TCCGACGGTC TGCGTCGTGG 150 

50 TCTGGCTGTT AAAGATCTCG AACACCCGAT CGAAGTCCCG GTAGGTAAAG 200 

CAACACTGGG TCGTATCATG AACGTTTTGG GTCAACCAAT CGACATGAAA 250 
GGCGACATCG GTGAAGAAGA CCGTTGGGCA ATCCACCGTG CAGCACCTTC * 300 

CTATGAAGAG CTGTCTAGCT CTCAGGAACT GCTGGAAACC GGCATCAAAG 3 50 

TTATCGACCT GATGTGTCCG TTCGCTAAGG GCGGTAAAGT TGGTCTGTTC 400 

55 GGCGGTGCGG GCGTGGGTAA AACTGTAAAC ATGATGGAGC TGATCCGTAA 4 50 

CATCGCGATC GAGCACTCCG GTTACTCCGT GTTTGCAGGC GTGGGTGAGC 500 

GTACTCGTGA GGGTAACGAC TTCTACCACG AAATGACCGA TTCCAACGTT 550 

CTGGATAAAG TATCCCTGGT TTATGGCCAG ATGAACGAGC CACCAGGAAA 600 

CCGTCTGCGC GTTGCGCTGA CCGGTCTGAC TATGGCTGAG AAGTTCCGTG 650 

50 ACGAAGGTCG TGACGTACTG CTGTTCGTAG ATAACATCTA CCGTTACACC 700 
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CTGGCCGGTA CTGAAGTATC TGCGCTGCTG 
AGGTTACCAG CCGACCCTGG CGGAAGAGAT 
TCACTTCTAC CAAAACCGGT TCTATCACCT 



PCT/CA00/01150 

GGCCGTATGC CTTCAGCGGT 750 
GGGCGTTCTG CAGGAACGTA 800 
CCG 833 



2) INFORMATION FOR SEQ ID NO: 282 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter asburiae 

(B) STRAIN: ATCC 35953 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282 

GCCGTACCAC GCGTGTACGA CGCGCTTGAG GTACAGAATG GTAACGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTGCGTACCA 100 

TCGCGATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTCAAAGAC 150 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACAC TGGGTCGTAT 200 

CATGAACGTA TTGGGTCAAC CAATCGACAT GAAAGGCGAC ATCGGTGAAG 250 

AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CTTCCTACGA AGAGCTGTCC 300 

AGCTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTTG 400 

GTAAAACCGT AAACATGATG ' GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CCGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA ; CCGACTCCAA CGTTCTGGAC AAAGTATCCC 550 

TGGTTTACGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGTC TGACGATGGC TGAGAAGTTC CGTGATGAAG GCCGTGACGT 650 

TCTGCTGTTC GTTGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 700 

TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA CCAGCCTACG 750 

CTGGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT CTACCAAAAC 800 

CGGTTCTATC 810 



2) INFORMATION FOR SEQ ID NO: 283 

(i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 811 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter cancerogenus 

(B) STRAIN: ATCC 35317 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 83 

GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTACAGAATG GTAACGAGAG 50 
CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTACGTACTA 100 
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TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTAAAAGAC 150 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACAC TGGGTCGTAT 200 

CATGAACGTA TTGGGTCAAC CAATCGACAT GAAAGGCGAC ATCGGTGAAG 250 

AAGAGCGTTG GGCTATCCAC CGCGCAGCAC CTTCCTACGA AGAGCTGTCC 300 

5 AGCTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 4 50 

TCCGGTTATT CCGTGTTTGC GGGCGTGGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAT AAAGTATCCC 550 

10 TGGTGTACGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGCC TGACCATGGC TGAGAAGTTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTTGATAACA TCTACCGTTA CACCCTGGCC GGTACCGAAG 700 

TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA TCAGCCTACG 750 

CTGGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT CAACCAAAAC 800 

15 CGGTTCTATC A 811 



2 ) INFORMATION FOR SEQ ID NO: 284 



20 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
3 0 (A) ORGANISM: Enterobacter cloacae 

(B) STRAIN: ATCC 13 047 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284 

35 GCCGTACCAC GCGTGTACGA CGCGCTTGAG GTACAGAATG GTAACGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTGCGTACCA 100 

TCGCGATGGG GTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGAC 150 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACAC TGGGTCGTAT 2 00 

CATGAACGTA TTGGGTCAGC CAATCGACAT GAAAGGCGAC ATCGGTGAAG 250 

40 AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CTTCCTACGA AGAGCTGTCC 300 

AGCTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTTGCG AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CCGTATTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

15 CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAC AAAGTATCCC 550 

TGGTTTACGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGCGTTGCG 600 

CTGACTGGTC TGACGATGGC TGAGAAGTTC CGTGACGAAG GCCGTGACGT 650 

TCTGCTGTTC GTTGATAACA TCTACCGTTA CACCCTGGCC GGTACGGAAG 700 

TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA TCAGCCTACG 750 

50 CTGGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT CTACCAAAAC 800 

CGGTTCTATC ACTTCCG 817 



55 2 ) INFORMATION FOR SEQ ID NO: 285 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 766 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

170 
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WO 01/23604 

(D) TOPOLOGY: Linear 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter gergoviae 

(B) STRAIN: ATCC 33028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285 



10 



AATGAGAGCC TGGTGCTGGA AGTTCAGCAG CAGCTCGGCG GCGGTATCGT 50 

GCGTACCATC GCAATGGGTT CTTCTGACGG TCTGCGTCGC GGTCTGGAAG 100 

TTAAAGATCT CGAACATCCG ATCGAAGTCC CGGTAGGTAA AGCGACCCTC 150 

GGCCGTATTA TGAACGTGCT GGGTCAGCCG GTTGATATGA AAGGCGATAT 200 

15 CGG CGAAGAA GAGCGTTGGG CGATCCACCG CGCTGCGCCG TCCTATGAAG 250 

AGCTCTCCAG CTCTCAGGAA CTGCTGGAAA CCGGTATCAA GGTAATGGAC 3 00 

CTGATTTGCC CGTTCGCGAA GGGCGGTAAA GTCGGTCTGT TCGGCGGTGC 350 

GGGCGTTGGT AAAACCGTAA ACATGATGGA GCTGATCCGT AACATCGCGA 400 

TCGAGCACTC CGGCTACTCC GTGTTTGCGG GCGTGGGTGA ACGTACTCGT 450 

2 0 GAGGGTAACG ACTTCTACCA CGAAATGACC GACTCCAACG TTATCGACAA 500 

AGTATCCCTG GTGTACGGCC AGATGAACGA GCCGCCGGGA AACCGTCTGC 550 

GCGTGGCGCT GACCGGTCTG ACCATGGCTG AGAAATTCCG TGACGAAGGT 600 

CGTGACGTTC TGCTGTTCGT CGATAACATC TACCGCTATA CCCTCGCCGG 650 

TACTGAAGTA TCCGCACTGC TGGGCCGTAT GCCTTCTGCA GTAGGTTACC 700 

25 AGCCGACGCT GGCGGAAGAG ATGGGTGTTC TGCAGGAACG TATCACCTCC 750 

ACCAAAACCG GTTCTA 766 



30 2) INFORMATION FOR SEQ ID NO: 286 

(i) SEQUENCE CHARACTERISTICS : 

(A) - LENGTH: 805 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

a0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterojbacter hormaechei 

(B) STRAIN: ATCC 4 9162 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286 

45 

GCCGTACCAC GCGTGTACGA CGCGCTTGAG GTACAGAATG GTAACGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTGCGTACCA 100 

TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTGAAAGAC 150 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACGC TGGGTCGTAT 2 00 

50 CATGAACGTA TTGGGTCAGC CAATCGACAT GAAAGGCGAC ATCGGTGAAG 250 
AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CTTCCTACGA AGAGCTGTCC . 3 00 

AGCTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTTGCG AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 4 50 

55 TCCGGTTACT CCGTGTTTGC GGGCGTGGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAC AAAGTATCCC 550 

TGGTTTACGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGCGTTGCG 600 

CTGACTGGCC TGACGATGGC TGAGAAGTTC CGTGACGAAG GCCGTGACGT 650 

TCTGCTGTTC GTCGATAACA TCTACCGTTA CACCCTGGCC GGTACGGAAG 700 

50 TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA TCAGCCAACG 750 
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CTTGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT " CGACCAAAAC 800 
CGGTT 805 



5 



2) INFORMATION FOR SEQ ID NO: 287 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 791 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter sakasakii 

(B) STRAIN: ATCC 29544 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287 

TACGACGCCC TTGAGGTAAC GAATGGTAAT GAGCGTCTGG TGCTGGAAGT 50 

CCAGCAGCAG CTCGGCGGCG GTATCGTACG TACCATCGCG ATGGGCTCTT 100 

CCGACGGTCT GCGTCGCGGT CTGCCTGTTG CAGACCTTGA GCACCCGATC 150 

25 GAAGTGCCGG TAGGTAAAGC GACGCTGGGT CGTATCATGA ACGTCCTGGG 2 00 

TCAGCCTATC GACATGAAAG GCGACATCGG CGAAGAAGAG CGTTGGGCGA 250 

TTCATCGCGC GGCGCCGTCC TATGAAGAGC TGTCCAGCTC TCAGGAACTG 300 

CTGGAAACCG GCATCAAAGT TATCGACCTG ATGTGTCCGT TCGCGAAGGG 350 

CGGTAAAGTC GGTCTGTTCG GTGGTGCAGG TGTAGGTAAA ACCGTAAACA 400 

30 TGATGGAGCT TATTCGTAAC ATCGCGATTG AGCACTCCGG TTACTCCGTG 450 

TTTGCGGGCG TGGGCGAACG TACCCGTGAA GGTAACGACT TCTACCACGA 500 

AATGACCGAC TCCAACGTAC TGGATAAAGT ATCCCTGGTG TACGGCCAGA 550 

TGAACGAGCC GCCGGGAAAC CGTCTGCGCG TTGCGCTGAC CGGCCTGACC 600 

ATGGCTGAGA AATTCCGTGA CGAAGGTCGT GACGTTCTGC TGTTCGTCGA 650 

35 CAACATCTAC CGTTACACCC TGGCCGGTAC TGAAGTATCC GCACTGCTGG 700 

GCCGTATGCC TTCAGCGGTA GGTTATCAGC CGACCCTGGC GGAAGAGATG 750 

GGTGTTCTGC AGGAGCGTAT CACCTCCACC AAAACCGGTT C 791 



40 



50 



2) INFORMATION FOR SEQ ID NO: 288 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 839 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus avium 

(B) STRAIN: ATCC 14025 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288 

TTTCTTTAGA TCAATCCTTA CCAGACATCA ACAATGCGTT GATTGTTTAC 50 

AAAAAAGATA AAACAAAAGT TGTTCTTGAA GTTGCTTTGG AACTTGGTGA 100 

TGGTGTTATC CGCACAATCG CTATGGAGGC TACTGATGGA TTGCAACGTG 150 

60 GAATGGAAGT TGTCGATACT GGCAAATCAA TCTCCGTTCC TGTAGGTAAA 200 
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GATACTCTAG GTCGTGTGTT TAACGTATTA GGTGAAACGA TTGATAAAGA 250 

AGCACCTTTT CCAGAAGATG CAGAAAGAAG CGGCATTCAT AAAAAGGCGC 3 00 

CTGCTTTTGA AGACCTTAGT ACAAGTAACG AGATTTTGGA AACAGGGATC 350 

AAGGTTATCG ACTTATTAGC CCCTTATTTA AAAGGTGGGA AAGTCGGACT 400 

5 ATTCGGTGGT GCCGGTGTTG GTAAAACCGT TTTGATCCAA GAATTAATTC 450 

ATAATATCGC CCAAGAACAC GGTGGTATTT CAGTGTTTAC CGGTGTTGGG 500 

GAACGTACTC GTGAAGGGAA CGACCTTTAT TATGAAATGA AAGACTCTGG 550 

CGTTATTGAG AAAACAGCCA TGGTGTTCGG ACAAATGAAC GAGCCGCCTG • 600 

GTGCACGTAT GCGTGTTGCC TTGACTGGTT TGACATTAGC TGAATATTTC 650 

10 CGTGATGAAG AAGGACAAGA TGTGTTGCTA TTTATTGACA ACATCTTCCG 700 

CTTTACTCAA GCCGGATCAG AAGTTTCTGC CTTATTAGGA CGTATGCCAT 750 

CAGCCGTTGG GTATCAACCA ACTTTGGCAA CTGAAATGGG GCAATTACAA 800 

GAACGAATCA CTTCAACCAA AAAAGGTTCG ATCACTTCA 839 



15 
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2 ) INFORMATION FOR SEQ ID NO: 2 89 



(i) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 847 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseli flaws 

(B) STRAIN : ATCC 25788 

r (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289 



•TTCTCTCTAG ACCAA?TCATT ACCAGATATC AACAATGCGT TGATTGTTTA 50 

CAAAAAAGAT GAGCAGAAAA CAAAAGTTGT GTTAGAAGCT GCCTTAGAAC 100 

35 TTGGCGACGG CGTTATCCGT ACGATTGCCA TGGAATCAAC AGATGGCTTA 150 

CAACGAGGAA TGGAAGTAAT CGATACAGGC GCCTCCATTT CTGTTCCAGT 2 00 

TGGGACAGAA ACCTTAGGAC GGGTGTTTAA TGTCTTAGGG GACACCATCG 250 

ATTTAGAAGC GCCGTTCCCT GAAGAAGCAC CCCGCAGTGG GATTCACAAA 300 

AAAGCACCTG ACTTTGATGA ATTGTCAACA AGTACGGAGA TCCTTGAAAC 350 

10 TGGGATCAAA GTTATCGATT TGTTAGCCCC TTATTTAAAA GGGGGGAAAG 400 

TTGGACTTTT CGGTGGTGCC GGTGTTGGTA AAACCGTCTT GATCCAAGAA 450 

TTGATCCACA ACATCGCCCA AGAGCATGGT GGGATCTCTG TCTTCACAGG 500 

TGTTGGTGAA CGGACACGTG AAGGAAATGA CCTTTATAAT GAAATGAAAG 550 

AATCTGGCGT TATCGAAAAA ACAGCCATGG TGTTTGGACA AATGAACGAA 600 

45 CCACCAGGTG CTCGGATGCG GGTAGCCTTG ACTGGTTTGA CATTAGCCGA 650 

GTACTTCCGT GATGTGGAAG GACAAGACGT GCTCTTGTTT ATCGATAATA 700 

TCTTCCGCTT C ACT CAAG C A GGTTCTGAAG TATCTGCCTT ACTAGGTCGG 750 

ATGCCGTCTG CCGTTGGGTA TCAGCCAACA TTAGCAACTG AGATGGGGCA 800 

ATTACAAGAA CGGATCACAT CGACGAAGAA AGGTTCCGTT ACGTCTA 847 



2) INFORMATION FOR SEQ ID NO: 290 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 
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PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus durans 
5 (B) STRAIN: ATCC 19432 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290 

TTTCTTTAGA CCAATCCTTA CCAGATATCA ACAACGCTTT AGTTGTTTAT 50 

10 AAAAATGATG AGAAGAAATC GAAAGTTGTT CTTGAAACAG CGCTAGAATT 100 

AGGTGACGGT GTCATCCGTA CAATCGCGAT GGAATCAACA GATGGTTTAC 150 

AACGCGGAAT GGAAGTCATT GATACAGAAA AAGCAATTTC TGTACCAGTG 2 00 

GGTAAAGAAA CGTTAGGTCG TGTATTCAAT GTATTAGGAG ATACGATCGA 250 

TTTATCTGCA CCTTTCCCAG AAGATGCAAA ACGTAGCGAA ATCCATAAAA 3 00 

15 AAGCACCAAA CTTTGATGAG TTAAGTACAA GTACTGAGAT CCTTGAAACT 350 

GGGATCAAAG TTATTGACTT GCTTGCTCCT TACTTAAAAG GTGGGAAAGT 400 

TGGATTATTC GGTGGTGCCG GTGTAGGTAA AACTGTATTG ATCCAAGAAT 450 

TGATCCATAA TATCGCTCAA GAACACGGTG GTATTTCTGT ATTTACTGGT 500 

GTTGGTGAAC GTACACGTGA AGGTAATGAC CTTTATTATG AAATGAAAGA 550 

20 TTCAGGAGTT ATTGAAAAAA CAGCCATGGT GTTTGGTCAA ATGAACGAAC 600 

CACCAGGTGC ACGTATGCGT GTTGCCTTGA CTGGTTTGAC GATTGCTGAA 650 

TACTTCCGTG ATGTTGAAGG GCAAGACGTG CTATTGTTTA TTGATAATAT 700 

TTTCCGTTTC ACTCAAGCCG GTTCAGAAGT TTCTGCCCTA TTAGGTCGTA 750 

TGCCTTCTGC CGTTGGGTAC CAACCAACGC TAGCAACAGA AATGGGTCAA 800 

25 TTACAAGAAC GGATCACTTC AACGAAAAAA GGTTCAATCA CTTCA 845 



2) INFORMATION FOR SEQ ID NO : 291 

(i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: 840 bases 

(B) ' TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: ATCC 29212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291 

15 TTAGATCAAT CCTTACCCGA TATTAACAAC GCTTTAGTCG TTTATAAAAA 50 

TGGCGAAGCA AAACAAAAAG TAGTACTTGA AGTCGCTTTA GAACTAGGTG 100 

ATGGAGTGAT TCGTTCTATC GCCATGGAAT CGACAGATGG TTTACAACGT 150* 

GGAATGGAAG TTATCGATAC AGGAAAATCA ATTTCAGTTC CTGTTGGTAA 200 

AGATACATTA GGTCGTGTGT TTAACGTTTT AGGAGACACA ATTGACTTAG 250 

50 AAGCGCCATT CCCTGCAGAT GCTGAACGTA GTGGGATTCA TAAAAAAGCG 3 00 

CCAGCATTTG ATGAATTAAG TACCAGTAAT GAAATTTTAG AAACAGGGAT 350 

TAAAGTTATT GACTTATTAG CACCTTATCT AAAAGGTGGT AAAGTCGGAC 400 

TTTTCGGTGG TGCCGGTGTT GGTAAAACCG TCTTAATTCA AGAATTAATT 450 

CATAATATTG CCCAAGAACA TGGAGGGATT TCCGTCTTTA CTGGTGTTGG 500 

55 TGAACGGACA CGTGAAGGGA ACGATCTGTA CTATGAAATG AAAGATTCAG 550 

GCGTTATTGA AAAAACAGCC ATGGTTTTTG GTCAAATGAA CGAACCGCCA 600 

GGTGCACGGA TGCGTGTGGC CTTAACTGGG TTAACGATTG CTGAATATTT 650 

CCGTGATGTG GAAGGACAAG ACGTGCTATT ATTTATTGAT AACATTTTCC 700 

GTTTCACCCA AGCCGGTTCA GAAGTTTCTG CCCTTTTAGG TCGGATGCCG 750 

50 TCAGCCGTTG GTTACCAACC AACCTTAGCG ACTGAAATGG GACAATTACA 800 
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AGAACGGATT ACTTCAACGA AAAAAGGATC AATTACCTCT 



PCT/CA00/01150 

840 



5 2) INFORMATION FOR SEQ ID NO: 292 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: ATCC 19434 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292 



TTTCTTTAGA CCAATCCTTA CCAGATATCA ACAACGCATT AGTTGTTTAT 50 

AAAAATGACG AAAATAAATC AAAAGTTGTT CTTGAAGCAG CGTTAGAATT 100 

AGGGGACGGA GTGATCCGGA CCATTGCGAT GGAATCAACA GATGGTTTAC 150 

AAAGAGGAAT GGAAGTCATT GATACAGGCA AAGCAATCTC TGTTCCTGTA 200 

25 GGTAAAGAAA CATTAGGTCG CGTATTCAAC GTACTAGGAG ATACGATCGA 2 50 

TTTAGAAACA CCTTTCCCAG AAGATGCGGA AAGAAGCGAA ATTCATAAAA 300 

AAGCACCAGC CTTTGACGAA TTAAGTACAA GTACAGAAAT TTTGGAAACA 350 

GGGATCAAAG TTATCGATTT GCTTGCCCCA TATTTAAAAG GTGGGAAAGT 400 

CGGACTATTC GGTGGTGCCG GTGTTGGTAA AACCGTACTG ATCCAAGAAC 450 

30 TGATCCATAA TATCGCCCAA GAACATGGTG GTATTTCTGT ATTTACCGGT 500 ' 

GTAGGTGAAC GTACTCGTGA AGGTAATGAC TTGTATTATG AAATGAAAGA 550 

TTCAGGAGTT ATCGAAAAAA CAGCCATGGT GTTCGGACAA ATGAACGAAC 600 

CACCAGGTGC ACGTATGCGT GTTGCTTTGA CTGGTTTGAC GATTGCGGAA 650 

TATTTCCGTG ATGTAGAAGG TCAAGATGTA CTGTTGTTTA TCGACAACAT 700 

35 TTTCCGTTTC ACTCAAGCTG GATCTGAAGT ATCAGCCTTG TTAGGACGGA 750 

TGCCTTCTGC GGTTGGTTAT CAACCAACAT TGGCAACAGA AATGGGTCAA 800 

TTGCAAGAAC GTATCACATC TACGAAAAAA G 831 



40 



50 



2) INFORMATION FOR SEQ ID NO: 2 93 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 826 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: ATCC 49573 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293 

CCTTACCAGA TATCAACAAT GCGTTGATCG TTTACAAAAA AGACGAGAAA 50 

AAAACAAAAG TAGTATTGGA AGCCGCTTTA GAACTAGGGG ATGGTGTGAT 10 0 

CCGCACCATC GCAATGGAAT CTACAGACGG TTTGCAACGA GGAATGGAAG 150 

60 TCATCGATAC CGGTGCCTCA ATCTCTGTCC CTGTAGGAAC AGATACTCTA 200 

175 
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GGCCGAGTAT TTAATGTACT AGGCGATACT ATCGACTTGG AAGCACCAT^ ' 25^ 

CCCAGAAGAT GCCAAACGTA GTGGCATCGA CAAAAAAGCC CCAGATTTCG 300 

ATGAATTGTC AACAAGTACA GAAATCCTTG AAACTGGGAT CAAAGTTATC 350 

GATTTATTAG CTCCTTACTT AAAAGGTGGT AAAGTCGGCT TGTTCGGTGG 400 

5 TGCCGGTGTT GGTAAAACCG TATTGATTCA AGAATTGATT CACAATATCG 450 

CTCAAGAGCA TGGGGGAATT TCAGTATTTA CCGGTGTTGG CGAACGGACG 500 

CGTGAAGGTA ATGACTTGTA TTATGAAATG AAAGAATCAG GCGTTATCGA 550 

AAAGACAGCC ATGGTTTTCG GTCAAATGAA TGAACCACCA GGTGCCCGGA 600 

TGCGGGTTGC TTTGACTGGT TTGACCATTG CTGAGTATTT CCGTGACGTT 650 

10 GAAGGACAAG ATGTGCTCTT GTTTATCGAT AATATTTTCC GTTTCACACA 700 

AGCGGGTTCT GAAGTATCTG CCTTGTTAGG CCGGATGCCA TCAGCCGTTG 750 

GTTATCAACC AACTCTAGCA ACTGAAATGG GTCAATTACA AGAACGAATC 800 

ACTTCTACGA AAAAAGGATC TGTAAC 826 



15 
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2) INFORMATION FOR SEQ ID NO: 294 



( i ) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus Baccharolyticus 

(B) STRAIN: ATCC 43076 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294 ' 



TTTCTTTGGA CCAATCCTTA CCAGACATCA ACAATGCGTT /GGTGGTCTAT 50 

AAAAAGAATG ATGAAAAAAC AAAAGTGGTA CTTGAAACAG CTTTAGAACT 100 

35 TGGTGATGGT GTCGTACGTA CGATTGCTAT GTCGTCAACA GACGGTTTGC 150 

AACGTGGGAT GGAAGTCATC GATACAGGAG CATCAATTTC TGTTCCTGTT 200 

GGGAAAGAGA CATTAGGACG TGTATTTAAC GTTTTAGGGG AGACTATCGA 250 

CTTAGATGGT CCATTCCCAG AAGAAGTAGC ACGAGATGGT ATTCATAAAA 3 00 

AGGCACCTGA TTTTGATGAA TTAAGTACAA GTACGGAGAT TCTTGAAACA 350 

40 GGGATTAAAG TAATCGATTT ATTAGCGCCT TAC TTAAAAG GTGGGAAAGT 400 

TGGTTTATTC GGTGGTGCCG GTGTAGGTAA AACGGTATTA ATTCAAGAAT 450 

TGATTAACAA TATTGCGCAA GAACATGGTG GTATTTCAGT ATTTGCGGGT 500 

GTTGGTGAGC GTACTCGTGA AGGAAATGAC CTTTATTATG AAATGAAAGA 550 

GTCGGGCGTT ATTGAGAAAA CAGCGATGGT TTTTGGACAA ATGAACGAAC 600 

15 CACCAGGTGC ACGTATGCGA GTTGCTTTAA CTGGTTTAAC CATTGCAGAA 650 

TACTTCCGTG ATGTTGAAGG ACAAGATGTA TTACTATTTA TTGATAACAT 700 

TTTCCGTTTT ACTCAAGCTG GTTCAGAAGT TTCAGCTTTA TTAGGACGTA 750 

TGCCTTCAGC GGTAGGGTAT CAACCGACAT TAGCAACAGA AATGGGACAA 800 

TTACAAGAAC GTATTACGTC AACGAAAAAA GGCTCAATTA CATCAA 846 

50 



2) INFORMATION FOR SEQ ID NO: 295 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) -LENGTH: 803 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia fergusonii 
5 (B) STRAIN: ATCC 35469 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 95 

GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTGCAAAATG GTAATGAGCG 50 

10 TCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGTGGTATC GTGCGTACCA 100 

TCGCAATGGG GTCCTCCGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAC 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCGACTC TGGGCCGTAT 200 

CATGAACGTA CTGGGTGAAC CGGTCGACAT GAAAGGCGAG ATCGGTGAAG 250 

AAGAGCGTTG GGCGATTCAC CGCGCAGCAC CTTCCTACGA AGAGCTGTCA 3 00 

15 AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 3 50 

TCCGTTCGCT AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACTGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CTGACTCCAA* CGTTATCGAC AAAGTATCCC 550 

20 TGGTATATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA 600 

CTGACCGGCC TGACCATGGC TGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

TCTGCTGTTC GTTGACAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGCCGT ATGCCTTCAG CGGTAGGTTA TCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTTCAGGAA CGTATCACCT CCACCAAAAC 800 

25 TGG 803 



2) INFORMATION FOR SEQ ID NO: 2 96 



30 



: (i) SEQUENCE CHARACTERISTICS': 

(A) LENGTH: 822 bases 

■ (B) TYPE: Nucleic acid. 
(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Escherichia hermannii 

(B) STRAIN: ATCC 33650 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296 

45 GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTGCAAA ATGGTGATGA 50 

GCGTCTGGTG CTGGAAGTGC AGCAGCAGCT CGGCGGCGGT ATCGTGCGTA 100 

CCATCGCAAT GGGTTCTTCC GACGGTCTGC GTCGTGGTCT GACTGTCGTC 150 

GACCTCGAGC AC C CGATCGA AGTCCCGGTA GGTAAAGCGA CCCTGGGCCG 2 00 

TATCATGAAC GTGCTGGGTC AGCCGATCGA CATGAAAGGC GATATCGGTG 250 

50 AAGAAGAGCG TTGGGCGATT CACCGCGCGG CGCCGTCCTA TGAAGAGCTG 3 00 

TCCAGCTCTC AGGAACTGCT GGAAACCGGC ATCAAAGTTA TCGACCTGAT 350 

GTGTCCGTTC GCGAAGGGCG GTAAAGTCGG TCTGTTCGGT GGTGCGGGCG 400 

TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ACTCTGTGTT TGCGGGCGTG GGTGAACGTA CTCGTGAGGG 500 

55 TAACGACTTC TACCATGAAA TGACCGACTC CAACGTTCTG GACAAAGTAT 550 

CCCTGGTTTA CGGCCAGATG AACGAACCGC CGGGAAACCG TCTGCGCGTT 600 

GCACTGACCG GCCTGACCAT GGCTGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTTCTGTTG TTCGTCGACA ACATCTACCG TTACACCCTG GCCGGTACTG 700 

AAGTATCCGC ACTGCTGGGC CGTATGCCTT CTGCGGTAGG TTACCAGCCG 750 

50 ACCCTGGCGG AAGAGATGGG CGTTCTGCAG GAGCGTATCA CCTCCACCAA 800 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



AACCGGTTCT ATCACCTCCG TA 



2) INFORMATION FOR SEQ ID NO: 297 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 808 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia vulneris 

(B) STRAIN: ATCC 33821 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297 

CCGAACGTGT ACGACGCCCT CGAAGTGACA AATGGTAATG AGCGTCTGGT 50 

GCTGGAAGTT CAGCAGCAGC TCGGCGGCGG TATCGTACGT ACCATCGCTA 100 

TGGGTTCTTC CGACGGTCTG CGTCGTGGTC TGGAAGTTCA GGACCTCGAG 150 

CACCCGATCG AAGTGCCGGT AGGTAAAGCG ACCCTGGGTC GTATCATGAA 2 00 

CGTACTGGGT CAGCCGATCG ATATGAAAGG CGACATCGGT GAAGAAGAGC 250 

GTTGGGCTAT TCACCGTGCA GCACCGTCCT ATGAAGAGCT CTCCAGCTCT 3 00 

CAGGAACTGC TGGAAACCGG CATCAAGGTT ATCGACCTGA TGTGTCCGTT 350 

CGCCAAGGGC GGTAAAGTCG GCCTGTTCGG CGGCGCGGGC GTGGGTAAAA 400 

CCGTAAACAT GATGGAGCTG ATCCGTAACA TCG CGATCGA GCACTCCGGT 4 50 

TACTCCGTGT TTGCAGGCGT GGGTGAGCGT ACTCGTGAGG GTAACGACTT 500 

CTACCACGAG ATGACCGACT CCAACGTTCT ' GGACAAAGTA TCCCTGGTGT 550 

ACGGCCAGAT GAACGAGCCG CCGGGAAACC GTCTGCGCGT GGCACTGACC 600 

GGCCTGACCA TGGCTGAGAA GTTCCGTGAC GAAGGTCGTG ACGTTCTGCT 650 

GTTCGTTGAC AACAT CTATC GTTACACCCT GGCCGGTACG GAAGTATCTG 700 

CACTGCTGGG CCGTATGCCT TCAGCGGTAG GTTACCAGCC GACGCTGGCG 750 

GAAGAGATGG GCGTTCTGCA GGAGCGTATC ACCTCCACCA AAACCGGTTC 800 

TATCACCT 808 



2) INFORMATION FOR SEQ ID NO: 298 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 843 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
'(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: jEuJbacteriuiri lentum 

(B) STRAIN: ATCC 43055 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298 



TTTCCCCCTG ATCAGCTGCC GGCGATTTAC AACGCGCTGA CGGTTGATGC 
CAAGACCCTG GCGGGCGACT TGCACCTCGT GCTCGAGGTC GAGACGCACC 
TGCCGGGCAA CCTTGTCCGC TCGGTGGCCA TGAGCTCGAC GGACGGTCTC 
GTCCGCGGCC TCGAGGTCGT CGACACGGGC AACCCGATCA TGATGCCCGT 



50 
100 
150 
2 00 
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GGGTCCCGAG 
ACGAGAAGCC 
GCTCCGGACT 
CATCAAGGCC 
GTCTGTTCGG 
ATCAACAACC 
GGGCGAGCGT 
CGGGCGTCAT 
CCGGGAGCGC 
CTTCCGCGAT 
GCTTCACGCA 
TCTGCCGTGG 
GGAGCGCATC 



ACCCTGGGTC 
GATGCCCGAG 
ACGACGAGCT 
ATCGACCTCG 
CGGCGCCGGC 
TGGCCCAGGA 
ACCCGCGAGG 
CAACAAGACC 
GTCTGCGCGT 
CAGGGCCAGG 
GGCCGGCTCC 
GTTACCAGCC 
ACGTCGACGT 



GCATCTGGAA 
GTGAAGGGCT 
GTCCACCACC 
TCGAGCCCTT 
GTGGGCAAGA 
GCACGGCGGC 
GTACCGACCT 
TGCCTCGTGT 
GGGTCTCGCG 
ACGTGCTTCT 
GAGGTGTCCG 
GACGCTGGCA 
CCACCGGCTC 



CGTCATGGGC GAGCCCGTCG 
ACATGCCCAT CCACCGTCCG 
ACCGAGATCT TCGAGACCGG 
CGTCAAGGGC GGCAAGACGG 
CGGTTATCAT CCAGGAGCTC 
ACGTCGGTGT TCACGGGCGT 
CTACCTGGAG ATGAGCGACT 
ACGGTCAGAT GAACGAGCCT 
GGCCTCACCG AGGCGGAGTA 
GTTCGTGGAC AACATCTTCC 
CTCTGCTGGG CCGCATGCCC 
ACCGAGATGG GCGACCTGCA 
CATCACGTCC GTG 



250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
843 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 299 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) 1 ORGANISM: Ewingella americana 

(B) STRAIN: ATCC 33852 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO": 299 

TCCCTCAGGA TGCAGTACCG AACGTGTACA ATGCTCTTGA GGTAGAAAAC 50 

GGTGCCTCCA AACTGGTTCT GGAAGTTCAG CAACAGTTAG GCGGCGGCGT 100 

TGTTCGTTGT ATCGCAATGG GTACCTCAGA CGGCCTTCGT CGCGGTCTGA 150 

AAGTGAACAA CCTGGAACAC CCAATTGAAG TTCCGGTTGG TAAAGCGACT 2 00 

CTGGGTCGTA TCATGAACGT ATTGGGTGAA CCAATCGACA TGAAAGGTGA 250 

AATCGGCGAA GAAGAACGTC GTGCAATTCA CCGTCCAGCG CCTTCTTATG 3 00 

AAGAGCTGGC TAACTCCCAA GAATTGCTGG AAACCGGTAT CAAAGTTATG 350 

GACCTGATGT GTCCGTTCGC TAAGGGCGGT AAAGTCGGTC TGTTCGGTGG 400 

TGCGGGTGTT GGTAAAACTG TAAACATGAT GGAGCTGATC CGTAACATCG 450 

CGATCGAGCA CTCCGGTTAC TCAGTGTTTG CAGGCGTGGG TGAGCGTACT 500 

CGTGAGGGTA ACGACTTCTA CCACGAAATG ACTGACTCCA ACGTTATCGA 550 

CAAAGTTTCC CTGGTCTATG GTCAGATGAA TGAGCCACCA GGTAACCGTC 600 

TGCGCGTTGC ACTGACCGGC CTGACCATGG CGGAGAAATT CCGTGATGAA 650 

GGTCGTGACG TACTGCTGTT CGTTGACAAC ATTTACCGTT ACACCCTGGC 700 

AGGTACCGAA GTGTCCGCAC TTCTGGGCCG TATGCCATCG GCGGTAGGTT 750 

ATCAGCCAAC GCTGGCGGAA GAGATGGGTG CTCTGCAAGA GCGTATCACC 800 

TCTACCAAAA GTGGTTCTAT CACCTCCGT 829 



2) INFORMATION FOR SEQ ID NO: 300 

( i ) SEQUENCE CHARACTER I STICS : 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(vi) ORIGINAL SOURCE: 

(A) ORGANI SM : Franci sel 1 a tularensi s 
5 (B) STRAIN: LVS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300 

AACACGCCTA AAGTATATGA TGCTTTAAAT GTAGTAGAAG CTGGTTTAGT 50 

L0 ATTAGAAGTT CAGCAACAAA TTGGTGATGG CGTAGTTCGT ACAATTGCTA 100 

TGGGATCTAG TGATGGTCTT AGACGTGGTA TGGAAGTTAA GAACACAAAT 150 

GCGCCTATTT CTGTTCCAGT TGGACATGGC ACACTTGGAC GTATCATGAA 200 

TGTTTTAGGT GAACCAATTG ATGAAGCTGG TCCAATTGAA TATACTGAGA 2 50 

AAAGATCTAT CCATCAAGCT CCTCCTGCAT ATGATGAGTT AGCATTAAGT 3 00 

L5 ACAGAAATAT TAGAAACAGG TATCAAAGTA GTTGACCTTA TTTGTCCATT 3 50 

TGCTAAGGGC GGTAAAGTTG GTTTATTTGG CGGTGCAGGT GTTGGTAAAA 400 

CTGTAACGAT GATGGAACTT ATCAACAATA TTGCAAAAGA ACATAGTGGC 450 

TACTCTGTAT TTTCCGGTGT TGGTGAAAGA ACTCGTGAAG GTAATGACTT 500 

CTACTATGAG ATGAAATATT CTAATGTATT GGATAAAGTA TCATTAGTAT 550 

20 ATGGTCAGAT GAATGAGCCG CCTGGAAACA GATTAAGAGT AGCTCTTAGT 600 

GGCTTAACAA TAGCAGAAGG ATTCCGTGAT GAAAAGCGTG ATGTTTTGAT 650 

GTTTATCGAT AACATCTATC GTTATACATT AGCAGGTACA GAGGTATCGG 700 

CGCTACTTGG TCGTATGCCA TCTGCTGTGG GTTATCAGCC AACGCTTGCA 750 

GCTGAGATGG GTGCTTTACA GGAGCGTATT ACATCTACTA AGACAGGATC 800 

25 TATTA 805 



2 ) INFORMATION FOR SEQ ID NO: 301 



30 



( i ) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 825 bases 

(B) ./TYPE: Nucleic acid J < 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Fusobacterium gonidiaformans 

(B) STRAIN: ATCC 25563 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301 

45 GACGAATTGC CAAAAATATA CAATGCATTA AAGGTGCAAG TTGGAGAAAA 50 

AGAACTTGTA TTGGAAGTGC AACAACATTT GGGAAATAAT GTTGTGAGAA 100 

CAGTAGCGAT GGACTCAACA GATGGATTGC TTCGAGGAAT GGAAGTAATG 150 

GATACCGGAG CACCGATTAC TGTTCCAGTA GGGAAGGCGG TTTTAGGAAG 200, 

AATATTGAAT GTTTTGGGAG AGCCTGTGGA TCAAAAAGGG CCTGTGGAAA 250 

50 CAGAAGAATA TTTACCTATC CATAGAGAAG CACCAAAATT TGAAGAACAA 3 00 

GAAACAGTAA CAGAAATTTT TGAAACAGGA ATTAAAGTCA TAGATTTGTT 350 
AGCCCCTTAT ATCAAAGGAG GAAAGACAGG TCTATTCGGT GGAGCCGGAG ' 4 00 

TAGGGAAAAC AGTTTTAATT ATGGAATTAA TTAATAACAT TGCAAAGGGC 450 

CACGGAGGAA TTTCTGTGTT TGCAGGAGTT GGAGAAAGAA CAAGAGAAGG 500 

55 AAGAGATTTA TACAACGAAA TGACAGAGTC CGGAGTTTTG AATAAGACCT 550 

CGTTGGTGTA TGGTCAAATG AATGAGCCGC CCGGAGCAAG ACTTCGTGTG 600 

GCGTTGACAG GATTAACGGT TGCTGAAAAC TTTAGAGATA AAGAAGGGCA 650 

AGATGTATTG TTGTTTATCG ACAATATCTT CCGTTTCACA CAAGCAGGAT 700 

CAGAAGTATC GGCTCTATTG GGAAGAATTC CATCGGCAGT AGGATATCAA 750 

50 CCGAACTTAG CGACAGAAAT GGGAACTTTA CAAGAAAGAA TTACTTCTAC 800 
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5 2) INFORMATION FOR SEQ ID NO: 302 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusobacterium necrophorum subsp. 
necrophorum 

(B) STRAIN: ATCC 25286 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302 

ACAATGCATT AAAGGTACAG GTGGGAGAAA GGGAACTTGT GTTGGAAGTG 50 

CAGCAACATT TAGGAAATAA TGTTGTCAGA ACAGTAGCAA TGGATTCAAC 100 

AGACGGATTA CTTCGGGGAA TGGAAGTGAG AGATACAGGA GTTCCCATTA 150 

25 CTGTTCCGGT AGGAAAGGCG GTTTTGGGAA GAATATTAAA TGTCTTAGGG 200 

GAGCCTGTGG ACGAAAAAGG TCCGATAGAG ACAGAAGAAT ATTTACCAAT 250 

ACATAGAGAA GCACCGAAAT TTGAAGAACA GGAAACGGTG ACAGAAATTT 300 

TTGAAACAGG AATTAAAGTC ATTGATTTGT TAGCTCCTTA TATTAAAGGA 3 50 

GGAAAAACAG GCCTATTCGG AGGAGCCGGA GTAGGAAAAA CCGTTTTGAT 4 00 

30 TATGGAACTG ATCAATAATA TTGCAAAAGG TCATGGAGGA ATTTCTGTTT 450 

. TTGCAGGAGT TGGAGAAAGA ACGAGAGAGG GAAGAGATCT ATACAACGAA 5.0.0 

ATGACAGAGT CCGGAGTTTT GAATAAAACT TCTTTGGTAT ATGGGCAAAT 550 

GAATGAGCCG CCCGGAGCAA GACTTCGAGT GGCTTTAACC GGACTTACTG 600 

TTGCCGAAAA TTTCAGAGAT AAAGAGGGAC AGGATGTCTT ATTGTTCATT 650 

35 GACAATATTT TCCGTTTCAC ACAAGCAGGT TCGGAAGTAT CGGCACTTTT 700 

GGGGAGAATT CCTTCTGCAG TGGGATATCA ACCGAACTTG GCGACAGAAA 750 

TGGGAAGCTT ACAAGAAAGA ATTACTTCTA CAAAATCCGG TTCTATCACT 800 

TCCGTG 806 



40 



2) INFORMATION FOR SEQ ID NO: 303 

( i ) SEQUENCE CHARACTER I STICS : 
45 (A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 (ii) MOLECULE TYPE: Genomic DNA 



55 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: FusoJbacteriiun nucleatum subsp. polymorphum 

(B) STRAIN: ATCC 10953 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303 



GATGAATTGC CTGCAATATA TAATGCTTTA AAAGTAAAAT TAGAAGATAA 50 

GGAACTTGTT CTAGAAGTTG AACAACATCT TGGTAACAAT GTTGTAAGAA 100 

50 CTGTTGCTAT GGATTCAACT GATGGATTAA AAAGAGGAAT GGAAGTTATA 150 

181 
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GATACAGGTA AACCAATTAC AGTACCAGTT GGTAAAGCTG TTCTTGGTAG 200 

AATATTAAAT GTTTTAGGAG AACCTGTTGA TAATCAAGGT CCTATAAATG 250 

CTGAAACATT TTTACCTATT CATAGAGAAG CACCAGAATT TGATGACTTA 3 00 

GAAACTGAAA CTGAAATATT TGAAACAGGA ATAAAAGTTA TAGACTTATT 350 

5 AGCACCATAT ATTAAAGGTG GAAAAATAGG ATTATTTGGT GGAGCTGGAG 400 

TAGGAAAAAC AGTTTTAATA ATGGAACTTA TCAACAACAT TGCAAAAGGA 4 50 

CATGGAGGAA TTTCAGTTTT TGCAGGAGTT GGAGAAAGAA CAAGAGAAGG 500 

TAGAGACTTA TATGGTGAAA TGACTGAATC AGGAGTTATC ACAAAAACAG 550 

CTCTTGTTTA TGGACAAATG AATGAGCCAC CTGGAGCAAG ACTTAGAGTT 600 

10 GCATTAACAG GGCTTACTGT TGCAGAAAAC TTTAGAGATA AAGATGGGCA 650 

AGATGTTCTT CTATTTATAG ATAATATATT TAGATTTACA CAAGCAGGTT 700 

CAGAAGTTTC AGCTTTACTT GGAAGAATAC CATCAGCTGT TGGATATCAA 750 

CCAAACCTAG CAACTGAAAT GGGTGCTTTA CAAGAAAGAA TAACATCTAC 800 

AAAATCTGGT TCAATTACAT C 821 

15 



2) INFORMATION FOR SEQ ID NO: 304 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 864 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gardnerella vaginalis 
30 (B) STRAIN: ATCC 49145 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304 

TTCCCAGTTG GCTATCTTCC AGATATTTAT AATGCTCTCA AGGTTGATAT 50 

35 CAACACCGTT GGAAACACGG AGGGAGATAC CGTCCACGAG ATTACATTGG 100 

AAGTTGAGCA GCACCTTGGT GATTCAACTG TGCGAGCAGT GGCACTTAAG 150 

CCTACGGACG GCTTGGTCCG TGGTGCTTTA GTGCGAGATA CTGGTGGCCC 200 

AATTTCTGTG CCTGTTGGAG ATGTTACAAA AGGTCACGTT TTTGACGTAA 250 

CTGGTAACAT TTTAAACGCT AAACCAGGCG AAAACATTGA GGTGACCGAG 300 

40 CGCTGGCCAA TCCACCGCAA CCCACCTGCT TTCGATCAGC TTGAGTCTAA 350 

GACTCAAATG TTTGAAACAG GCATTAAGGT TATCGATTTG CTTACGCCTT 400 

ACGTTCAGGG CGGAAAGATT GGTCTGTTCG GTGGTGCAGG CGTTGGTAAA 450 

ACTGTGTTGA TTCAGGAGAT GATTCAGCGC GTTGCACAGA ACCACGGCGG 500 

TGTGTCTGTG TTTGCTGGCG TTGGCGAACG TACTCGTGAG GGTAACGATT 550 

45 TGATTGGCGA AATGGCTGAG GCTGGCGTTT TGGAGAAAAC AGCGCTTGTC 600 

TTTGGTCAGA TGGATGAGCC TCCTGGGACT CGTCTTCGTG TGCCTCTTAC 650 

TGCTTTGACT ATGGCTGAGT ATTTCCGTGA TGTTCAGAAT CAGGATGTGT 700 

TGCTGTTTAT CGACAACATC TTCCGCTTTA CTCAGGCAGG TTCTGAGGTT 750 

TCCACGTTGC TTGGTCGTAT GCCTTCTGCA GTTGGTTATC AGCCAAACTT 800 

50 GGCGGATGAA ATGGGTGCGT TGCAGGAGCG CATTACTTCT ACGCGCGGTC 850 

ATTCTATTAC GTCG 864 



55 2) INFORMATION FOR SEQ ID NO: 305 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 848 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

182 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(D) TOPOLOGY: Linear 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gemella haemolysans 

(B) STRAIN: ATCC 10379 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305 

LO 

TCGAATCAGG GCATATGCCA AATC TATTAA ACGCTTTAGA AGTTTACATA 50 

GAAAAAGGCG ATGGGAAAAA AGAAAAATTA GTTCTTGAAG TTTCTCTTGA 100 

AATTGGTGAT AACGTAGTAA GAACAATCGC TATGTCATCT ACTGATGGAT 150 

TAAATAGGGG AGCAGAAGTA GTAGATACAG GAGCACCAAT TACAGTTCCT 2 00 

L5 GTAGGTAACT ACACATTAGG TCGTGTGTTC AACGTATTAG GTGAAGCAGT 250 

TGACCACGGT GAAGAAGCAG GAGCAGAAGT TCGTAAAGAT TCAATTCACA 3 00 

AAGAAGCTCC AACATTCGAT GAATTATCAA CTCACGTTGA GGTTCTTGAA 350 

ACAGGTATTA AAGTTATCGA CTTACTTGCA CCATATATTA AAGGTGGTAA 400 

AATCGGTCTT TTCGGTGGTG CGGGAGTTGG TAAAACGGTT CTTATCCAAG 450 

20 AACTTATCAA CAACGTTGCG CAACAACACG GTGGATTATC AGTATTCACA 500 

GGTGTAGGTG AGCGTACTCG TGAAGGAAAT GACTTATACT ATGAAATGAA 550 

AGATTCTGGT GTTATTAACA AAACAGCCAT GGTATTCGGA CAAATGAACG 600 

AACCACCAGG TGCTCGTATG CGTGTAGCAT TAACAGGATT AACAATGGCG 650 

GAATACTTCC GTGATGAAGA AGGACAAGAC GTGCTTCTAT TCATCGATAA 700 

25 CATTTTCCGT TTCACACAAG CAGGTTCTGA GGTTTCTGCG TTATTAGGAC 750 

GTATGCCATC AGCCGTTGGT TACCAACCAA CACTTGCTAC AGAGATGGGA 800 

CGTTTACAAG AACGTATAAC ATCAACTAAA AAAGGTTCTG TTACATCT 848 



30 



40 



2) INFORMATION FOR SEQ ID NO: 306 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 84 8 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gemella morbillorum 

(B) STRAIN: ATCC 27824 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306 

TCGAATCAGG GCATATGCCT AATCTACTAA ACGCTTTAGA AGTTTATATA 50 

GAAAAAGGCG ATGGAAAAAA AGAAAAATTA GTTCTTGAAG TTTCTCTTGA 100 

AATCGGGGAT AATGTCGTAA GAACTATTGC GATGTCATCT ACTGATGGAT 150 

50 TAAACAGAGG GGCAGAAGTA GTTGATACTG GAGCGCCAAT TACAGTGCCA 200 

GTAGGTAACT ATACATTAGG ACGTGTGTTC AACGTATTAG GTGAAGCAGT 250 

TGACCACGGA GAAGAAGCTG GAGCAGAAGT TCAAAAAGAA TCTATTCATA 300 

AAGAAGCTCC AACTTTCGAA GAATTATCAA CACATGTTGA GGTATTAGAA 3 50 

ACAGGTATTA AAGTTATCGA CCTTCTTGCA CCATATATTA AAGGTGGTAA 400 

55 GATTGGACTA TTCGGTGGTG CTGGAGTTGG GAAAACAGTT CTTATCCAAG 450 

AACTTATTAA CAACGTAGCA CAACAACACG GAGGACTTTC AGTATTTACT 500 

GGGGTAGGTG AACGTACTCG TGAGGGTAAC GACTTGTACT ATGAAATGAA 550 

AGACTCTGGA GTTATTAATA AAACTGCCAT GGTATTTGGT CAAATGAATG 600 

AGCCACCAGG TGCACGTATG CGTGTTGCCT TAACAGGATT AACAATGGCA 650 

50 GAGTACTTCC GTGATGAAGA AGGACAAGAC GTACTATTAT TTATCGATAA 700 
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TATCTTCCGT TTCACACAAG CAGGGTCTGA 
GTATGCCTTC AGCCGTTGGA TATCAACCAA 
CGTCTTCAAG AACGTATTAC ATCAACTAAA 



GGTATCTGCA 
CTCTTGCAAC 
AAAGGATCTG 



TTATTAGGGC 
AGAAATGGGA 
TTACATCT 



750 
800 
848 



LO 



L5 



20 



25 



50 



35 



10 



15 



50 
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2) INFORMATION FOR SEQ ID NO: 307 

( i ) SEQUENCE CHARACTER I STI CS : 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus ducreyi 

(B) STRAIN: DSM 8925 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307 

GATGCAGTAC CAAAAGTATA TGATGCTTTA AAAGTTGAAT CAGGTTTAAC 50 

CTTAGAAGTT CAACAACAAT TAGGTGGTGG TTTAGTACGT TGTATCGCAT 100 

TAGGTACCTC AGATGGTTTA AAGCGTAGCT TAAAGGTTGT AAATACAGGT 150 

AACCCTATTC AAGTTCCTGT AGGCACTAAA ACATTAGGCC GTATTATGAA 200 

TGTATTAGGC GAACCAATTG ATGAAAAAGG ACCTATTAGC GAAGAAGCTC 250 

GTTGGGATAT TCATCGTGCG GCTCCAAATT ATGAAGAACA GTCAAATAGT 300 

ACTGAATTAC TTGAAACCGG TATCAAAGTT ATTGACTTAA TTTGTCCATT 350 

TGCAAAAGGT GGTAAAGTCG GCTTATTTGG TGGAGCTGGT GTAGGTAAAA 400 

CCGTTAATAT GATGGAATTG ATCCGTAATA TTGCTATTGA GCACTCAGGT 450 

TATTCGGTTT TTGCTGGTGT AGGTGAGCGT ACTCGTGAAG GTAATGATTT ■ 500 

TTATCATGAA ATGACGGATT CTAATGTATT AGATAAAGTA TCACTAGTAT 550 

ATGGTCAAAT GAATGAACCA CCAGGTAACC GCCTACGTGT TGCGTTAACA 600 

GGTTTAACTA TGGCTGAAAA ATTCCGTGAT GAAGGTCGTG ATGTATTATT 650 

TTTCGTAGAT AATATTTATC GTTATACTTT AGCCGGTACA GAAGTTTCTG 700 

CTTTATTAGG CCGTATGCCA TCAGCGGTAG GTTATCAACC AACCCTTGCA 750 

GAAGAAATGG GTGTATTACA AGAACGTATT ACCTCAACTA AAACTGGTTC 800 

AATCACGGCA GTA 813 



2) INFORMATION FOR SEQ ID NO : 3 08 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus haemolyticus 

(B) STRAIN: ATCC 333 90 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 08 

TGAATTTCCA CAAGATGCAG TGCCAAAAGT TTACGATGCA TTAAAAGTTG 50 
AATCAGGTTT AACACTTGAG GTGCAACAAC AATTAGGTGG CGGTGTGGTA 100 
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CGTTGTATCG CATTAGGTGC TTCTGACGGT TTAAAACGTG GTTTAAAAGT 150 

AGAAAACACG AATGATCCGA TTCAAGTACC GGTAGGCACA AAAACCCTTG 200 

GTCGTATCAT GAATGTATTG GGTGAACCAA TTGACGAACA AGGTCCAATC 250 

GGTGAAGAAG AGCGTTGGGC TATCCATCGT TCTGCACCAA GCTATGAAGA 300 

5 ACAATCAAAC AGTACGGAAT TATTAGAGAC TGGTATCAAA GTTATCGACT 350 

TAATTTGTCC ATTCGCAAAA GGTGGTAAAG TTGGTCTATT CGGTGGTGCG 4 00 

GGTGTAGGTA AAACCGTTAA CATGATGGAA TTAATCCGTA ACATCGCGAT 450 

CGAGCACTCA GGTTACTCCG TATTTGCGGG TGTAGGTGAA CGTACTCGTG 500 

AAGGTAACGA CTTCTATCAT GAAATGAAAG ATTCTAACGT ATTAGATAAA 550 

10 GTATCTTTGG TTTATGGTCA GATGAATGAG CCACCAGGTA ACCGTTTACG 600 

TGTTGCGTTA ACTGGTTTAA CCATGGCAGA AAAATTCCGC GATGAAGGTC 650 

GTGATGTATT ATTCTTCGTG GATAATATCT ATCGTTATAC CCTTGCTGGT 700 

ACGGAAGTAT CTGCGTTATT AGGTCGTATG CCATCTGCGG TAGGTTACCA 750 

ACCAACTCTT GCTGAAGAAA TGGGTGTGTT ACAAGAACGT ATCACTTCAA 800 

L5 CCAAAACAGG TTCTATTACA TCTGTA 82 6 



2) INFORMATION FOR SEQ ID NO: 309 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 809 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Haemophilus parahaemolyticus 

(B) STRAIN-: ATCC 10014 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309. 

35 GATGCAGTAC CAAAAGTATA TGATGCGTTA AAAGTTGAAT CAGGTTTAAC 50 

GCTTGAAGTT CAACAACAAT TAGGCGGTGG CTTAGTGCGC TGTATCGCAT 100 

TAGGTACGTC TGATGGTTTA AAACGTGGCT TAAAAGTAGA AAATACAGGC 150 

AACCCAATTG AAGTGCCAGT GGGCACTAAA ACCCTTGGTC GTATTATGAA 200 

CGTATTGGGT GAGCCGATTG ACGAAAAAGG TCCTATCGGT GAAGAAGCAC 250 

4 0 GCTGGGCAAT CCACCGTGCA GCACCAAGCT ACGAAGAGCA ATCAAATAGC 300 

ACGGAATTAC TCGAAACAGG TATCAAAGTT ATCGACTTAA TCTGCCCATT 350 

CGCAAAAGGG GGTAAAGTTG GTTTATTTGG TGGTGCAGGT GTAGGTAAAA 400 

CCGTAAATAT GATGGAGTTA ATC CGTAACA TCGCGATCGA ACACTCTGGT 450 

TACTCTGTAT TTGCAGGGGT AGGTGAGCGT ACTCGTGAAG GTAATGACTT 500 

45 CTACCACGAA ATGACAGACT CTAACGTATT AGATAAAGTA TCGTTAGTGT 550 

ATGGTCAAAT GAACGAACCA CCAGGTAACC GTTTACGCGT AGCTTTAACA 600 

GGCTTAACCA TGGCGGAAAA ATTCCGCGAT GAAGGTCGTG ACGTATTATT 650 

CTTCGTCGAT AACATCTACC GTTATACCCT AGCAGGTACG GAAGTGTCAG 700 

CACTTCTCGG TCGTATGCCA TCTGCGGTAG GTTATCAGCC AACCTTAGCA 750 

50 GAAGAAATGG GTGTATTACA AGAGCGTATC ACTTCAACCA AAACTGGTTC 800 

TATCACCTC 809 



55 2) INFORMATION FOR SEQ ID NO: 310 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 



PCT/CA00/01150 



10 



30 



40 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus parainfluenzas 

(B) STRAIN: ATCC 7901 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310 



CGAATTTCCA CAAGATGCAG TACCAAAAGT TTATGATGCA TTAAAAGTTG 50 

AATCGGGTTT AACCCTTGAA GTTCAACAAC AATTAGGTGG TGGTGTGGTA 100 

CGTTGTATCG CACTGGGAGC TTCTGACGGT TTAAAACGCA GTTTAAGCGT 150 

TGAAAATACC AATAAACCAA TTTCAGTACC GGTTGGTGTA AAAACTCTCG 200 

15 GTCGTATTAT GAACGTATTG GGCGAACCGA TTGATGAAAG AGGTCCTATC 250 

GGTGCGGAAG AAGAATGGGC AATTCACCGT TCTACTCCAA GTTATGAAGA 3 00 

ACAGTCCAAC AGTACCGAAT TATTAGAAAC CGGTATCAAA GTTATCGACT 350 

TAATTTGTCC ATTCGCGAAG GGTGGTAAAG TTGGTTTATT CGGTGGTGCG 400 

GGTGTAGGTA AGACCGTAAA TATGATGGAA TTAATCCGTA ATATTGCGAT 450 

2 0 TGAGCACTCA GGTTACTCCG TATTTGCCGG TGTAGGTGAG CGTACCCGTG 500 

AAGGTAACGA CTTCTACCAT GAAATGACAG AATCTAACGT ATTAGACAAA 550 

GTATCCCTAG TTTACGGACA AATGAATGAG CCGCCGGGTA ACCGTTTACG 600 

TGTTGCTTTA ACCGGTTTAA CCATGGCAGA AAAATTCCGT GACGAAGGTC 650 

GTGATGTATT ATTCTTCGTG GATAACATCT ATCGTTATAC CCTTGCAGGG 700 

25 ACTGAAGTAT CGGCACTTTT AGGCCGTATG CCATCAGCGG TAGGTTATCA 750 

GCCGACACTT GCAGAAGAAA TGGGTGTGTT ACAAGAACGT ATTACATCAA 800 

CCAAAACAGG TTCTATTACT TCTG 824 



2) INFORMATION FOR SEQ ID NO: 311 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 811 bases 
35 (B) TYPE": Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hafnia alvei 

(B) STRAIN: ATCC 13337 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311 

GCCGTGCCTA AAGTGTATAA CGCACTTGAG GTGAAAGGCG GTGCCACTAA 50 

ACTGGTACTG GAAGTTCAGC AGCAGCTAGG CGGCGGCGTT GTACGCTGTA 100 

TCGCTATGGG TACTTCTGAC GGTCTGCGTC GCGGACTGGA CGTTGTTGAC 150 

50 CTGGAGCACC CGATTGAAGT CCCAGTAGGT AAAGCGACCT TAGGCCGCAT 200 

TATGAACGTA CTGGGTGAGC CAATTGATAT GAAGGGTGAT ATCGGCGAAG 250 

AAGATCGCTG GGCTATTCAC CGTGAAGCTC CAAGCTACGA AGAACTGTCT 300 

AACTCGCAAG AACTGCTGGA AACTGGTATC AAGGTAATGG ACCTGATTTG 350 

TCCGTTCGCT AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTTG 400 

55 GTAAAACAGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

TCAGGTTACT CTGTATTTGC CGGCGTGGGT GAACGTACTC GTGAGGGTAA 500 
CGACTTCTAC CACGAAATGA CCGACTCCAA CGTATTGGAC AAAGTATCAC • 550 

TGGTTTATGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGTC TGACTATGGC TGAGAAGTTC CGTGACGAAG GTCGTGACGT 650 

60 ACTGCTGTTC AT CGATAACA TCTACCGTTA TACCTTGGCC GGTACCGAAG 700 
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TATCTGCACT GTTGGGTCGT ATGCCTTCTG CGGTAGGTTA TCAGCCAACG 750 
CTGGCGGAAG AGATGGGTGT TCTGCAAGAA CGTATCACCT CGACCAAAAC 800 
GGGTTCAATC A 811 



2) INFORMATION FOR SEQ ID NO: 312 

(i) SEQUENCE CHARACTERISTICS: 
LO (A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kingella kingae 

(B) STRAIN: ATCC 23330 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312 



GCGATGCTAT TCCACGCGTT TACGATGCGT TGAAACTGGT TGATGTGGAC 50 

TTGACATTAG AAGTGCAACA ACAACTGGGC GATGGCGTAG TGCGTACCAT 100 

25 TGCGATGGGT AGTACCGATG GTTTGAAACG CGGCTTAGCC GTGAACAACA 150 

CAGGCGCACC TATTACAGTG CCTGTTGGTA AAGCAACATT GGGTCGTATT 200 

ATGGACGTAT TGGGTAATCC TGTTGATGAA GCAGGTCCAA TTGGTTCTGA 250 

CCAAACGCGT GCTATTCACC AACCAGCTCC TAAATTTGAT GAACTGTCTA 300 

GCGCAACCGA ATTGCTGGAA ACAGGCATCA AAGTGATTGA CTTGCTTTGC 350 

10 CCATTTGCAA AAGGTGGTAA AGTAGGTTTG TTTGGTGGTG CAGGTGTGGG 400 

CAAAACTGTG AACATGATGG : AGTTGATTAA CAACATTGCC AAAGCGCACA 450 

GTGGTTTGTC TGTATTTGCA GGCGTGGGTG AACGTACTCG CGAAGGTAAT 500 

GACTTCTATC ACGAGATGAA AGATTCTAAC GTGTTGGATA AAGTTGCCAT 550 

GGTGTATGGT CAAATGAATG AACCTCCTGG CAACCGTTTG CGCGTTGCAT 600 

35 TGACTGGTTT GTCTATGGCA GAACACTTCC GTGATGAAAA AGACGAAAAT 650 

GGCAAAGGTC GCGATGTATT GTTCTTTGTG GACAACATCT ATCGCTACAC 700 

ATTGGCAGGT ACAGAAGTAT CGGCATTGCT GGGTCGTATG CCCTCTGCGG 750 

TAGGTTATCA ACCAACATTG GCAGAAGAAA TGGGTCGTTT GCAAGAGCGT 800 

ATTACTTCAA CGCAAACAGG TTCGATTACT T 831 

10 



2) INFORMATION FOR SEQ ID NO: 313 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. ozaenae 
55 (B) STRAIN: ATCC 11296 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313 

ATGCCGTACC ACGCGTGTAC GAAGCCCTTG AGGTACAGAA TGGTAATGAA 50 
50 GTTCTGGTGC TGGAAGTTCA GCAGCAGCTG GGCGGCGGTA TCGTACGTAC 100 
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CATCGCCATG GGTTCTTCTG ATGGTCTGCG CCGCGGTCTG GATGTAAAAG 150 

ACCTCGAGCA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC GCTGGGTCGT 2 00 

ATCATGAACG TACTGGGTCA ACCGGTTGAC ATGAAAGGCG ACATCGGCGA 250 

AGAAGAGCGT TGGGCTATCC ACCGCGCGGC ACCGTCCTAT GAAGAGCTGT 300 

5 CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT CGACCTGATG 350 

TGTCCGTTCG CCAAGGGCGG TAAAGTTGGT CTGTTCGGCG GTGCGGGTGT 4 00 

AGGTAAAACT GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATCGAGC 450 

ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAGCGTAC TCGTGAGGGT 500 

AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG ATAAAGTATC 550 

LO CCTGGTGTAC GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CGCTGACCGG CCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTACTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG CCGGTACTGA 700 

AGTATCCGCG CTGCTGGGTC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC CTCCACCAAA 800 

L5 ACCGGTTCTA TC 812 



2) INFORMATION FOR SEQ ID NO: 314 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Klebsiella ornithinolytica 

(B) STRAIN: 1 ATCC 31898 

* (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314 

35 ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTACAGAA TGGTAATGAG 50 

AGCCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGTGGTA TCGTACGTGC 100 

TATCGCCATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG GAAGTTAAAG 150 

ACCTTGAGCA CCCGATCGAA GTCCCGGTTG GTAAAGCAAC GCTGGGTCGT 200 

ATCATGAACG TGCTGGGTCA GCCAATCGAT ATGAAAGGCG ACATCGGCGA 250 

10 AGAAGAGCGT TGGGCTATTC ACCGTGCAGC TCCGTCCTAT GAAGAGCTGT 3 00 

CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT CGACCTGATG 350 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCGGGTGT 400 

AGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATCGAGC 4 50 

ACTCCGGTTA CTCCGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT 500 

15 AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTCTGG ATAAAGTATC 550 

CCTGGTTTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CTCTGACCGG CCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTTCTGCTGT TCGTCGATAA CATCTATCGT TATACCCTGG CCGGTACTGA 700 

AGTATCCGCA CTGCTGGGTC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

50 CCCTGGCGGA AGAGATGGGT GTTCTGCAGG AACGTATCAC CTCCACCAAA 800 

ACCGGTTCTA TC 812 



55 2) INFORMATION FOR SEQ ID NO: 315 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 33496 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315 



LO 



GTACCGCGCG TGTACGAGGC TCTTGAGGTA CAAAATGGTA GTGAGAATCT 50 

GGTGCTGGAA GTTCAGCAGC AGCTCGGCGG CGGTATTGTT CGTACCATCG 100 

CCATGGGTTC TTCCGACGGT CTGCGTCGCG GTCTGGAAGT CAAAGACCTC 150 

GAGCATCCGA TCGAAGTCCC GGTAGGTAAA GCAACGCTGG GTCGTATCAT 2 00 

L5 GAACGTACTG GGCCAACCGG TAGACATGAA AGGCGACATC GGCGAAGAAG 2 50 

AGCGTTGGGC GATTCACCGC GCAGCGCCTT CCTACGAAGA GTTGTCAAAC 3 00 

TCTCAGGAAC TGCTGGAAAC CGGCATCAAA GTTATCGACC TGATGTGTCC 350 

GTTTGCGAAG GGCGGTAAAG TTGGTCTGTT CGGTGGTGCG GGTGTAGGTA 400 

AAACCGTAAA CATGATGGAG CTGATCCGTA ACATCGCGAT CGAGCACTCC 450 

20 GGTTACTCCG TGTTTGCGGG CGTAGGTGAA CGTACTCGTG AGGGTAACGA 500 

CTTCTACCAC GAAATGACCG ACTCCAACGT TAT C GAT AAA GTATCCCTGG 550 

TGTATGGCCA GATGAACGAG CCGCCGGGAA ACCGTCTGCG CGTTGCGCTG 600 

ACCGGCCTGA CCATGGCTGA GAAGTTCCGT GACGAAGGTC GTGACGTTCT 650 

GCTGTTCGTC G AT AAC AT CT ATCGTTACAC CCTGGCCGGT ACTGAAGTAT 700 

25 CCGCACTGCT GGGTCGTATG CCTTCAGCGG TAGGTTACCA GCCGACTCTG 750 

GCGGAAGAGA TGGGCGTTCT GCAGGAACGT ATCACCTCCA CCAAAACGGG 800 

TTCTATCACT TCC 813 



30 



2 ) INFORMATION FOR SEQ ID NO: 316 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 822 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella planticola 

(B) STRAIN: ATCC 33531 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316 

GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTACAGA ATGGTAATGA 50 

GAGCCTGGTG CTGGAAGTTC AGCAGCAGCT CGGCGGTGGT ATCGTACGTG 100 

CTATCGCCAT GGGTTCTTCT GACGGTCTGC GTCGTGGTCT GGAAGTTAAA 150 

50 GACCTTGAGC ACCCGATCGA AGTCCCGGTT GGTAAAGCAA CGCTGGGTCG 200 

TATCATGAAC GTGCTGGGTC AGCCGATCGA TATGAAAGGC GACATCGGCG 250 

AAGAAGAGCG TTGGGCTATT CACCGCGCAG CTCCGTCTTA TGAAGAGCTG 300 

TCCAGTTCTC AGGAACTGCT GGAAACCGGC ATCAAAGTTA TCGACCTGAT 350 

GTGTCCGTTC GCTAAGGGCG GTAAAGTAGG TCTGTTCGGT GGTGCGGGCG 400 

55 TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ACTCCGTGTT TGCGGGCGTC GGTGAACGTA CTCGTGAGGG 500 

TAACGACTTC T AC C ACGAAA TGACCGACTC CAACGTTCTG GATAAAGTAT 550 

CCCTGGTTTA TGGCCAGATG AACGAGCCGC CGGGAAACCG TCTGCGCGTT 600 

GCTCTGACCG GCCTGACCAT GGCTGAGAAA TTCCGTGACG AAGGTCGTGA 650 

50 CGTTCTGCTG TTCGTCGATA ACATCTATCG TTATACCCTG GCCGGTACTG 700 
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AAGTATCCGC ACTGCTGGGT CGTATGCCTT 
ACCCTGGCGG AAGAGATGGG TGTTCTGCAG 
AACCGGTTCT ATCACTTCCG TA 



PCT/CA00/01150 

CAGCGGTAGG TTATCAGCCG 750 
GAACGTATCA CCTCCACCAA 800 

822 



2) INFORMATION FOR SEQ ID NO: 317 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 785 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 13883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317 

AGAATGGTAA TGAAGTTCTG GTGCTGGAAG TTCAGCAGCA GCTGGGCGGC 50 

GGTATCGTAC GTACCATCGC CATGGGTTCT TCTGATGGTC TGCGCCGCGG 100 

TCTGGATGTA AAAGACCTCG AGCACCCGAT CGAAGTCCCG GTAGGTAAAG 150 

CAACGCTGGG TCGTATCATG AACGTACTGG GTCAACCGGT TGACATGAAA 200 

GGCGACATCG GCGAAGAAGA GCGTTGGGCT ATCCACCGCG CGGCACCGTC 250 

CTATGAAGAG CTGTCCAGCT CTCAGGAACT GCTGGAAACC GGCATCAAAG 300 

TTATCGACCT GATGTGTCCG TTCGCCAAGG GCGGTAAAGT TGGTCTGTTC 350 

GGCGGTGCGG GTGTAGGTAA AACTGTAAAC ATGATGGAGC TGATCCGTAA 400 

CATCGCGATC GAGCACTCCG GTTACTCTGT GTTTGCGGGC GTAGGTGAGC 450 

GTACT C GTGA GGGTAATGAC TTCTACCACG AAATGACCGA CTCCAACGTT 500 

ATCGATAAAG TATCCCTGGT GTACGGCCAG ATGAACGAGC CGCCGGGAAA 550 

CCGTCTGCGC GTTGCGCTGA CCGGCCTGAC CATGGCTGAG AAATTCCGTG 600 

ACGAAGGTCG TGACGTACTG CTGTTCGTCG ATAACATCTA TCGTTACACC 650 

CTGGCCGGTA CTGAAGTATC CGCACTGCTG GGTCGTATGC CTTCAGCGGT 700 

AGGTTATCAG CCGACCCTGG CGGAAGAGAT GGGCGTTCTG CAGGAACGTA 750 

TCACCTCCAC CAAAACCGGT TCTATCACCT CCGTA 785 



2) INFORMATION FOR SEQ ID NO: 318 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 759 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) , TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera ascorbata 

(B) STRAIN: ATCC 33433 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318 

CTGGTGCTGG AAGTTCAGCA GCAGCTCGGC GGCGGTATCG TACGTWCCAT 50 
CGCTATGGGT TCTTCCGACG GTCTGCGTCG CGGTCTGGAT GTTAAAGATC 100 
TCGAGCACCC AATCGAAGTT CCGGTMGGTA AAGCAACMCT GGGTCGTATC 150 
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ATGAACGTAC TGGGTCAKCC AGTMGACATG AAAGGCGACA TCGGTGAaSa ~200" 

AGAGCGTTGG GCTATCCACC GCGCTGCACC TTCCTACGAA GAGCTGTCTA 250 

GCTCTCAGGA ATTGCTGGAA ACCGGTATCA AAGTTATCGA CCTGATGTGT 300 

CCGTTCGCTA AGGGCGGTAA AGTCGGTCTG TTCGGTGGTG CSGGTGTTGG 350 

5 TAAAACCGTA AACATGATGG AGCTGATCCG TAACATCGCG ATCGAGCACT 400 

CCGGTTACTC CGTGTTTGCG GGCGTAGGTG AACGTACTCG TGAGGGTAAC 450 

GACTTCTACC ACGAAATGAC CGACTCCAAC GTTATCGATA AAGTATCCCT 500 

GGTATATGGC CAGATGAACG AGCCACCGGG AAACCGTCTG CGCGTTGCTC 550 

TGACCGGTCT GACCATGGCT GAGAAATTCC GTGACGAAGG TCGTGACGTA 600 

LO CTGCTGTTCG TCGATAACAT CTATCGTTAC ACCCTGGCCG GTAC TGAAGT 650 

ATCTGCWCTG CTGGGTCGTA TGCCTTCAGC GGTAGGTTAC CAGCCGACCC 700 

TGGCGGAAGA GATGGGCGTT CTGCAGGAAC GTATCACCTC CACCAAGACC 750 

GGTTCTATC 759 



L5 



2) INFORMATION FOR SEQ ID NO: 319 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera cryocrescens 

(B) STRAIN: ATCC 33435 

(xi )> SEQUENCE DESCRIPTION: SEQ ID NO: 319 



10 



TTCCCTCAGG ATGCCGTACC GCGTGTGTAC GAAGCCCTTG AGGTTCAGAA 50 

TGGTAATGAA GTGCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA 100 

15 TCGTACGTAC CATCGCTATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG 150 

GATGTAAAAG ACCTCGAGCA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC 200 

ACTGGGTCGT ATCATGAACG TACTGGGCCA ACCGGTAGAC ATGAAAGGCG 250 

ACATCGGTGA AGAAGAACGT TGGGCTATCC ACCGTGCAGC ACCTTCCTAC 300 

GAAGAGCTGT CAAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT 350 

10 CGACCTGATG TGTCCGTTTG CGAAGGGCGG TAAAGTTGGT CTGTTCGGTG 400 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT TCGTAACATC 450 

GCGATTGAGC ACTCCGGTTA TTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ATAAAGTTTC CCTGGTTTAC GGCCAGATGA ACGAGCCACC AGGAAACCGT 600 

15 CTGCGCGTTG CGCTGACTGG TCTGACTATG GCTGAGAAGT TCCGTGACGA 650 

AGGTCGCGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG 700 

CCGGTACAGA AGTATCTGCA CTGCTGGGTC GTATGCCTTC AGCGGTAGGT 750 

TACCAGCCGA CTCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACCAAA ACCGGTTCTA TCACCTCCGT A 831 

50 



2) INFORMATION FOR SEQ ID NO: 320 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

191 
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(ii) MOLECULE TYPE: Genomic DNA 



PCT/CAOO/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera georgiana 
5 (B) STRAIN: ATCC 51603 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320 

GCCGTACCGC GCGTGTACGA AGCCCTTGAG GTACAGAATG GTAATGAAGT 50 

L0 GCTGGTGCTG GAAGTTCAGC AGCAGCTCGG TGGCGGTATC GTGCGTACCA 100 

TCGCCATGGG TTCCTCCGAC GGTCTGCGTC GCGGTCTGGA AGTTAAAGAT 150 

CTCGAGCACC CGATCGAAGT TCCGGTAGGT AAAGCAACAC TGGGTCGTAT 2 00 

CATGAACGTA CTGGGTCACC CGGTAGACAT GAAAGGCGAC ATCGGTGAAG 250 

AAGAGCGTTG GGCTATCCAC CGCGCTGCGC CTTCCTACGA AGAGCTGTCC 3 00 

L5 AGCTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGCGGT GCGGGTGTTG 4 00 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 4 50 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAT AAAGTATCCC 550 

20 TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGCC TGACCATGGC TGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACTGAAG 700 

TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA CCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAGAC 800 

2 5 CGGTTCTATC 810 



2) INFORMATION FOR SEQ ID NO: 321 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Lactobacillus acidophilus 

(B) STRAIN: ATCC 4356 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 321 

15 TCGATAAGAA TTTACCTGAT ATTAACAACG CCTTACGTGT AATCAAGTCC 50 

GAAGATGAAA GCATCGTTCT TGAAGTTACA CTTGAACfCG GTGATGGTGT 100 

TTTAAGAACA ATCGCCATGG AATCTACCGA TGGTCTTCGT CGTGGTATGA 150 

AAGTCGAAGA TACTGGCGCT CCAATTTCAG TTCCAGTTGG AGAAGACACT 2 00 

TTAGGTCGTG TGTTTAACGT TTTAGGACAG CCTATTGATG GTGGTCCAGC 250 

50 CTTTCCAAAG GATCACCCAC GTGAGGGTAT CCACAAGGAA GCACCTAAAT 300 

ATGAAGATTT AACTACTAGT CGTGAAATTC TTGAAACTGG TATCAAGGTT 350 

ATCGACCTTC TTGAACCATA TGTTCGTGGT GGTAAAGTTG GTTTGTTTGG 400 

TGGTGCCGGT GTTGGTAAAA CTACTATTAT TCAAGAATTA ATTCACAACA 450 

TCGCTCAAGA ACACGGTGGT ATTTCCGTAT TTACTGGTGT TGGTGAAAGA 500 

55 ACTCGTGAAG GTAATGACCT TTACTTTGAA ATGAAAGCTT CAGGCGTTTT 550 

AAGTAAGACT GCCATGGTAT TTGGTCAGAT GAACGAGCCG CCTGGTGCCA 600 

GAATGCGTGT TGCATTAACC GGTTTGACAC TTGCTGAATA CTTTAGAGAT 650 

GTTGAAGGTC AAGACGTATT GCTCTTTATT GACAATATCT TTAGATTTAC 700 

TCAGGCTGGT TCAGAGGTAT CTGCTTTGCT TGGTCGTATG CCAAGTGCCG 750 

SO TAGGTTATCA GCCAACTTTG GCAACAGAAA TGGGTCAATT GCAGGAAAGA 800 

192 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

ATTACTTCTA CTAAGAAGGG TTCAATTACT TCAA 



PCT/CA00/01150 



5 2 ) INFORMATION FOR SEQ ID NO: 322 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Legionella pneumophila subsp. pneumophila 

(B) STRAIN: ATCC 33152 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322 



TTCCTCGTGA TAGCGTGCCT AAAGTCAATG ATGC GTTAAA GCTTGTTGAT 50 

AGTGATCTGG TTTTTGAAGT GCAGCAGCAA CTTGGAGACG GAGTTGTGCG 100 

TACTATTGCC ATGGGAACAA CCGATGGTTT AAAGCGAGGA TTAAAAGCAG 150 

AAAATACAGG CCATCCTATT CAAGTGCCAG TAGGTAAGAA AACTTTGGGA 200 

25 CGCATTATGG ATGTTCTTGG GCGTCCTGTA GATGATGCTG GGCCTATCGA 250 

TGCTGAAGAG ACTTGGGCTA TTCATCGTAA AGCACCAAGT TATGAAGAGC 3 00 

AAGCTGGCAG CCAGGAATTA TTGGAAACTG GTATTAAAGT AATTGATTTG 3 50 

CTTTGCCCTT TTGCCAAGGG AGGTAAAGTT GGTCTATTCG GTGGTGCCGG 4 00 

TGTAGGCAAA ACCGTTAACA TGATGGAATT AATACGAAAC ATTGCAATTG 450 

3 0 AGCATAGCGG TTATTCAGTG TTTGCAGGGG TTGGTGAACG TACCCGTGAA 500 

GGAAACGACT TCTATCATGA GATGAAAGAC TCTAATGTAT TGGATAAAGT 550 

ATCGCTTGTT . TATGGTCAGA TGAATGAGCC GCCAGGAAAC CGTTTGCGTG 600 

TTGCTCTAAC CGGTTTGACT ATGGCTGAAA AATTCCGGGA TGAAGGGCGA 650 

GACGTTCTTT TGTTTATCGA TAATATTTAT CGTTATACCT TGGCTGGGGT 700 

35 TGAAGTATCT GCGCTGTTAG GCCGTATGCC TTCTGCAGTA GGATATCAGC 750 

CGACATTAGC AGAGGAAATG GGTATGCTGC AAGAGCGCAT TACCTCCACA 800 

AAAACAGGTT CTATTACTTC CATA 824 



40 



50 



2) INFORMATION FOR SEQ ID NO: 323 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 818 bases 
4 5 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leminorella grimontii 

(B) STRAIN: ATCC 33999 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323 

GACGCCGTAC CGAAAGTGTA CGATGCGCTT GAAGTTCAAA TTGATGCCAA 50 

GCTGGTTCTG GAAGTTCAAC AGCAGCTCGG CGGCGGCGTT GTTCGCTGCA 100 

TCGCGATGGG TACTTCAGAC GGCTTAAGCC GCGGTCTGGA CGTGCTCGAT 150 

60. CTGGAACACC CGATTGAAGT ACCGGTGGGC AAAGCGACGC TGGGCCGCAT 200 
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CATGAACGTG CTTGGTCACC CTATCGACAT GAAGGGCGAC ATCGGCGAAG ' ' 2'Bff 

AAGAGCGTTG GGCTATTCAC CGCGCAGCGC CGAGCTACGA AGACCTGTCG 3 00 

GGCGCAACCG AGCTGCTGGA GACCGGCATC AAGGTTATCG ACCTGATTTG 350 

TCCGTTCGCC AAGGGCGGTA AAGTCGGCCT GTTCGGCGGC GCCGGCGTAG 4 00 

5 GTAAAACCGT AAACATGATG GAGCTCATTC GCAACATTGC GACCGAGCAC 4 50 

TCCGGTTACT CCGTGTTTGC AGGCGTAGGT GAACGTACCC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CTGAATCCAA CGTATTGGAC AAGGTGTCGC 550 

TGGTATACGG TCAGATGAAC GAGCCGCCTG GAAACCGTCT GCGCGTAGCG 600 

TTAACGGGCT TGACCATGGC GGAGAAGTTC CGTGATGAAG GCCGTGACGT 650 

10 TCTGCTGTTT ATCGACAACA TTTACCGCTA TACCCTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGCCGT ATGCCTTCAG CCGTAGGCTA CCAGCCGACT 750 

CTGGCTGAGG AAATGGGCGT GCTTCAAGAG CGTATTACCT CTACCAAGAC 800 

GGGGTCTATC ACCTCCGT 818 



15 



2) INFORMATION FOR SEQ ID NO: 324 



(i) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM:: Listeria monocytogenes 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324 

GTGGAAACTT ACCTGAAATC TACAATGCCC TAGTTATTGA ATATAAATCT 50 

GATGCAGAAG AAGCACCAAC TAGCCAACTT ACTTTAGAAG TAGCCATCCA 100 

ATTAGGTGAT GATGTTGTAC GTACAATCGC AATGGCATCA ACAGATGGTG 150 

35 TTCAAAGAGG TATGGAAGTT ATTGATACTG GGAGCCCAAT TACAGTTCCT 200 

GTAGGTACAG TAACTCTTGG TCGTGTATTT AATGTATTAG GAAACACCAT 250 

CGATTTGGAC GAACCACTTC CAAGCGATAT TAAACGTAAT AAAATTCACC 300 

GCGAAGCTCC AACTTTCGAT CAATTAGCAA CGACAACAGA AATTCTTGAA 350 

ACAGGAATTA AAGTAGTTGA CTTGCTAGCT CCTTACTTAA AAGGTGGTAA 400 

40 AATCGGATTG TTCGGTGGTG CCGGTGTTGG TAAAACCGTT CTAATTCAAG 450 

AGCTTATCCA TAATATCGCA CAAGAACATG GTGGTATTTC TGTGTTCGCT 500 

GGCGTTGGAG AACGTACTCG TGAAGGTAAC GACCTTTACT TCGAAATGAA 550 

AGATTCAGGC GTTATTGAAA AAACAGCGAT GGTATTCGGT CAAATGAACG 600 

AGCCACCAGG TGCGCGTATG CGTGTTGCCT TAACTGGTCT AACAATTGCT 650 

45 GAATATTTCC GTGATGAAGA ACATCAAGAT GTACTTTTAT TCATTGATAA . 700 

CATTTTCCGT TTCACACAAG CTGGTTCAGA GGTTTCGGCT TTACTAGGTC 750 

GTATGCCATC TGCGGTAGGT TACCAACCAA CCCTAGCTAC TGAAATGGGT 800 

CAACTACAAG AACGTATTAC ATCTACTAAC GTTGG 835 

50 

2) INFORMATION FOR SEQ ID NO: 325 

(i) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 82 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Micrococcus lylae 

(B) STRAIN: ATCC 27566 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325 

CCCGCGTGGC GAGTTGCCGG CACTGTTCAA CGCGCTGACT GTCGAGGTCA 50 

CCCTCGAAGC AGTCGCTAAA ACCATTACCC TTGAGGTTGC TCAGCACCTC 100 

10 GGCGACAACT TGGTTCGTGC CGTGTCCATG GCACCGACCG ACGGTCTCGT 150 

CCGTGGCGCT GCTGTGATCG ACAGCGGTAA GCCGATCTCA GTTCCCGTTG 200 

GTGACGTAGT CAAGGGACAC GTCTTCAACG CTCTGGGTGA TTGCCTCGAT 250 

GAGCCAGGTC TTGGCCGTGA CGGTGAGCAG TGGGGCATCC ACCGCGATCC 300 

GCCACCTTTT GACCAGCTTG AGGGTAAGAC CGAGATTCTG GAAACCGGTA 350 

15 TTAAGGTCAT CGACCTGCTG ACCCCGTATG TTAAGGGCGG CAAGATCGGC 400 

CTGTTCGGTG GTGCTGGTGT GGGTAAGACC GTTCTTATCC AGGAAATGAT 450 

CACCCGTATC GCTCGCGAGT TCTCCGGTAC CTCGGTGTTC GCAGGCGTGG 500 
GTGAGCGTAC CCGTGAGGGC ACCGACCTCT TCCTGGAAAT GGAAGAGATG , 550 

GGCGTTCTCC AGGACACCGC TCTTGTGTTC GGCCAGATGG ACGAGCCTCC 600 

20 AGGAGTTCGT ATGCGCGTGG CGCTGTCCGG CCTGACCATG GCGGAGTACT 650 

TCCGCGATGT GCAGCACCAG GACGTGCTTC TGTTCATCGA CAACATCTTC 700 

CGTTTCACCC AGGCAGGTTC CGAGGTTTCC ACCCTCCTAG GCCGCATGCC 750 

TTCTGCCGTG GGTTACCAGC CAACGCTGGC AGACGAGATG GGTGTTCTGC 800 

AGGAGCGTAT TACCTCCACA AAGGGTAA 828 

25 



2 ) INFORMATION FOR SEQ ID NO: 326 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) . TYPE: Nucleic acid 
(OK STRANDEDNESS : Double 
(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Moellerella wisconsensis 
10 (B) STRAIN: ATCC 35017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326 

GATGCCGTAC CAAAAGTGTA CGATGCTCTT GAGGTTCTTA ACGGTAAAGA 50 

15 AAAATTGGTG CTGGAAGTTC AGCAACAATT AGGCGGTGGT GTTGTTCGTT 100 

GTATCGCAAT GGGTACATCA GATGGTTTAA GCCGCGGTTT AGAAGTTAAA 150 

AATACAGATC ATCCGATCGA AGTTCCTGTC GGTGTTAAAA CGCTTGGCCG 200 

TATCATGAAC GTGCTGGGTG ACCCAATCGA CATGAAAGGT GATATCGGCG 250 

AAGAAGAACG CTGGTCAATT CACCGCGCAG CACCAAGCTA TGAAGATCTG 300 

50 GCTAACTCAA CAGAACTTCT AGAAACAGGT ATCAAAGTTA TGGACCTGAT 350 

TTGCCCATTC GCTAAAGGGG GTAAAGTGGG TCTGTTCGGT GGTGCGGGTG 400 

TCGGTAAAAC AGTTAACATG ATGGAGCTTA TTCGTAATAT CGCGATTGAG 450 

CACTCAGGTT ATTCTGTATT CGCGGGTGTT GGTGAACGTA CTCGTGAAGG 500 

TAACGATTTC TACCATGAAA TGACAGACTC AAACGTTCTG GATAAAGTTT 550 

55 CATTGGTTTA TGGCCAGATG AATGAGCCAC CAGGAAACCG TCTGCGTGTT 600 

GCTCTGACTG GTCTGACTAT GGCAGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTACTGTTA TTCGTAGATA ATATTTATCG TTATACCTTA GCAGGGACAG 700 

AAGTATCTGC ACTGCTGGGT CGTATGCCTT CAGCGGTGGG TTATCAGCCA 750 

ACGCTGGCGG AAGAGATGGG TGTTCTGCAA GAACGTATCA CCTCGACTAA 800 

50 GACCGGCTCT ATCACTTCCG TA 822 
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2 ) INFORMATION FOR SEQ ID NO: 327 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 854 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Branhamella catarrhalis 

(B) STRAIN: ATCC 43628 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327 



CCGTGGCGAT GTCCCCCAAA TCTTTGATGC 
AAACCACCCT TGAAGTCCAA CAACAGTTAG 
ATTGCCATGG GTTCTACCGA AGGCTTAAAG 
TTCAGGTGCA CCCATTTCGG TACCAGTCGG 
TTATGGATGT CCTAGGTCGC CCAATCGATG 
GAACAAAAAT GGTCCATTCA TCGTGAAGCA 
AAATAGTACA GAACTTTTAG AAACAGGCAT 
GTCCATTTGC CAAAGGTGGT AAAGTCGGTC 
GGTAAGACCG TTAACATGAT GGAGCTTATC 
CTCAGGTCTG TCGGTTTTTG CTGGTGTGGG 
ATGACTTCTA CCATGAAATG CAAGAAGCAG 
TTTACTCAGT CAAAAGTTGC CATGGTTTAT 
AGGAAACCGT CTGCGTGTTG CCTTAACTGG 
TCCGTGATGA AAAAGACGAA GCAACGGGCA 
TTCGTTGATA ATAT TTATCG TTACACATTG 
ACTTTTAGGT CGTATGCCAT CTGCGGTAGG 
AAGAGATGGG CTTGCTACAA GAGCGTATCA 
ATTA 



ACTTCATGTT GATGGTACTG 50 

GTGATGGTGT GGTGCGTACC 100 

CGTGGCTTGC CTGTCTCTAA 150 
TCAAGCAACA CTGGGTCGCA • 200 

AAGCAGGTCC GGTAAATGCT 250 

CCAAGTTATG ATGAACAGTC 300 

CAAAGTGATT GATTTGCTTT 350 

TGTTCGGTGG TGCTGGTGTT 400 

AATAATATCG CCCTAAAACA 4 50 

TGAGCGTACT CGTGAGGGTA 500 

GCGTTGTTAA TACCGAAGAT 550 

GGTCAGATGA ATGAGCCACC 600 

TTTGACCATG GCAGAGTATT 650 

AAGGCCGTGA TGTTCTGCTG 700 

GCAGGTACTG AGGTATCAGC 750 

TTATCAGCCG ACTTTGGCCG 800 

CCTCCACCCA ATCAGGCTCA 850 

854 



2) INFORMATION FOR SEQ ID NO: 328 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Moraxella osloensis 

(B) STRAIN: ATCC 19976 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328 

CCGTCAAAGC GTACCAAGAA TTTATGATGC CTTAAAAGTT GAAGGCACAG 50 

AAACTACATT AGAAGTACAA CAACAATTGG GTGATGGTAT CGTACGTACT 100 

ATTGCCATGG GTTCTACTGA AGGTCTAAAA CGTGGTCTAC CAGTTAGCAA 150 

CACTGGCGCA CCAATCTCTG TACCTGTGGG TAAAGGTACA CTAGGTCGTA 200 
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TCATGGACGT 
AGTAACACTT 
AAACTCTACT 
GCCCATTTGC 
GGTAAAACCG 
CTCAGGTTTA 
ATGACTTCTA 
ATGGTGTATG 
CCTGACAGGT 
ACGGTAAAGG 
ACGCTAGCGG 
AGTAGGGTAT 
GTATTACTTC 



TTTAGGACAC 
GGGCGATTCA 
GAACTTTTAG 
TAAAGGTGGT 
TTAACATGAT 
TCGGTATTTG 
CCACGAGATG 
GTCAGATGAA 
TTAACCATGG 
TCGTGACGTA 
GTACCGAAGT 
CAGCCAACGC 
AACCCAATCA 



CCAATCGATG 
CCGTGAAGCG 
AAACCGGTAT 
AAAGTCGGTC 
GGAACTTATC 
CTGGTGTAGG 
AAAGACTCAA 
TGAGCCACCA 
CAGAATACTT 
TTATTGTTCG 
ATCAGCATTA 
TTGCAGAAGA 
GGCTCTATTA 



AGGCAGGTCC GGTAGXgCA^ 
CCAAGCTATG ATGAACAATC 
TAAAGTAATT GACTTACTAT 
TGTTCGGTGG CGCGGGTGTT 
AATAACATCG CAAAAGCACA 
TGAGCGTACT CGTGAAGGTA 
ACGTACTTGA TAAAGTTGCG 
GGAAACCGTT TACGTGTTGC 
CCGTGACGAA AAAGATGAAA 
TTGACAATAT TTATCGTTAC 
TTAGGTCGTA TGCCATCTGC 
GATGGGTGTA CTACAAGAAC 
C 



25© * 

300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
831 



2) INFORMATION FOR SEQ ID NO: 329 

(i) SEQUENCE CHARACTERISTICS : 

(A) ' LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Morganella morganii subsp. morganii 

(B) STRAIN: ATCC 25830 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329 

CGAATTTCCT CAGGATGCAG TACCGAAAGT GTACGATGCG CTTGAGGTAA 50 

CAAATGGTAA AGAAAAACTG GTGCTGGAAG TTCAGCAGCA GTTAGGCGGC 100 

GGGGTTGTCC GTTGTATCGC TATGGGTACA TCTGATGGTC TGAGCCGTAA 150 

TCTGGAAGTA ACCGATTTAG GCCACCCGAT CGAAGTCCCT GTCGGCGTGA 200 

AAACCTTAGG ACGTATCATG AACGTTCTGG GTGATCCGAT CGATATGAAA 250 

GGTGACATCG GCGCAGAAGA AAAATGGTCT ATTCACCGTG CTGCACCAAC 3 00 

ATACGAAGAA CTGTCTAACT CCCAGGAACT GCTGGAAACA GGTATCAAAG 350 

TAATGGACCT GATCTGCCCG TTCGCGAAGG GTGGTAAAGT CGGTCTGTTC 4 00 

GGTGGTGCGG GTGTGGGTAA AACCGTAAAC ATGATGGAAC TGATCCGTAA 4 50 

CATCGCGATC GAGCACTCCG GTTACTCTGT ATTCGCAGGG GTCGGTGAGC 500 

GTACCCGTGA AGGTAACGAC TTCTATCATG AAATGACAGA CTCCAACGTT 550 

CTGGACAAAG TATCACTCGT GTACGGCCAG ATGAACGAGC CACCGGGAAA 600 

CCGTCTGCGC GTTGCTCTGA CCGGTCTGAC CATGGCGGAA AAATTCCGTG 650 

ATGAAGGCCG CGATGTACTG CTGTTCGTTG ATAACATCTA CCGTTATACC 700 

CTGGCCGGTA CTGAAGTATC CGCGCTGTTA GGCCGTATGC CTTCAGCGGT 750 

AGGTTACCAG CCGACACTGG CGGAAGAAAT GGGTGTGCTT CAGGAACGTA 800 

TCACATCGAC CAAAACAGGC TCTATCACGT CTGTA 835 



2) INFORMATION FOR SEQ ID NO : 33 0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pantoea agglomerans 
5 (B) STRAIN: ATCC 27155 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330 

GACGCGGTAC CGCAAGTGTA CAGCGCCCTC GAGGTTATGA ATGGTGATGC 50 

10 GCGTCTGGTG CTGGAAGTTC AGCAGCAGCT CGGCGGCGGC GTAGTACGTA 100 

CCATCGCAAT GGGTACGTCT GACGGCCTGA AGCGTGGTCT GAGCGTCAAC 150 

GACCTGCAGA AACCGATTCA GGTACCCGTC GGTAAAGCGA CCCTGGGCCG 2 00 

TATCATGAAC GTTCTCGGCG AGCCAATCGA TATGAAAGGC GAGCTGAAAG 2 50 

AAGAAGATGG CAGCGCAGTA GAGATCGCCT CTATTCACCG CGCAGCCCCT 3 00 

15 TCTTATGAAG ATCAGTCTAA CTCGCAGGAA CTGCTGGAAA CCGGCATCAA 350 

GGTTATCGAC CTGATGTGTC CGTTTGCTAA AGGCGGTAAA GTCGGTCTGT 4 00 

TCGGTGGTGC GGGTGTAGGT AAAACCGTCA ACATGATGGA ACTGATCCGT 4 50 

AACATCGCGG CTGAACACTC AGGTTACTCA GTGTTTGCCG GTGTGGGTGA 500 

GCGTACTCGT GAGGGTAACG ACTTCTACCA CGAAATGACT GACTCTAACG 550 

2 0 TTATCGATAA AGTTGCACTG GTCTATGGCC AGATGAACGA GCCGCCGGGT 600 

AACCGTCTGC GCGTAGCACT GACCGGTCTG ACCATGGCGG AAAAATTCCG 650 

TGATGAAGGT CGCGACGTTC TGCTGTTCAT CGATAACATC TACCGTTATA 700 

CCCTGGCCGG TACAGAAGTT TCTGCACTGC TGGGTCGTAT GCCATCTGCG 750 

GTAGGTTACC AGCCAACGCT GGCAGAAGAG ATGGGTGTGT TGCAGGAGCG 800 

2 5 TATTACCTCC ACCAAGACCG GTTC 824 



2) INFORMATION FOR SEQ ID NO: 331 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 808 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
3 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Pantoea dispersa 

(B) STRAIN: ATCC 14589 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331 

45 TATACAGCGC TCTCGAGGTA AAAAATGGTG ATGCTCGTCT GGTGCTGGAA 50 

GTACAGCAGC AGCTGGGCGG TGGCGTGGTG CGTACCATCG CCATGGGTTC 100 

TTCTGACGGC CTGAAGCGCG GTCTGGAAGT CACCGACCTG AAAAAACCTA 150 

TCCAGGTTCC GGTTGGTAAA GCAACACTCG GCCGTATCAT GAACGTGCTG 2 00 

GGTGAGCCAA TCGACATGAA AGGCGACCTG AAAGAAGAAG ACGGCAGCGC 2 50 

50 TGTAGAGGTT TCCTCTATTC ATCGCGCAGC GCCTTCTTAT GAAGATCAGT 3 00 

CAAACTCGCA GGAACTGCTG GAAACCGGCA TCAAGGTTAT CGACCTGATG 3 50 

TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG GTGCGGGTGT 4 00 

AGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC GCGGCTGAGC 4 50 

ACTCAGGTTA TTCGGTCTTT GCCGGCGTGG GTGAGCGTAC TCGTGAGGGT 500 

55 AACGACTTCT ACCACGAAAT GACGGACTCC AACGTTATCG ATAAAGTAGC 550 

GCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGTAACCGT CTGCGCGTAG 600 

CACTGACCGG TCTGACCATG GCGGAAAAAT TCCGTGATGA AGGCCGTGAC 650 

GTTCTGCTGT TCATCGACAA CATCTACCGT TACACCCTGG CCGGTACAGA 700 

GGTTTCTGCA CTGCTGGGTC GTATGCCATC GGCGGTAGGT TATCAGCCAA 750 

50 CGCTGGCTGA AGAGATGGGT GTGCTGCAGG AGCGTATTAC CTCCACCAAG 800 
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SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
ACCGGTTC 



PCT/CAOO/01150 



5 2) INFORMATION FOR SEQ ID NO: 332 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

L5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pasteurella multocida 

(B) STRAIN: NCTC 10322 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332 



GATGCAGTAC CAAAAGTATA TGATGCCTTA AATGTTGAAA CAGGTTTAGT 50 

ACTTGAAGTT CAACAACAAT TAGGTGGTGG TGTAGTTCGC TGTATCGCAA 100 

TGGGATCATC TGATGGATTA AAACGCGGTT TAAGCGTAAC AAATACGAAT 150 

AACCCAATTT CTGTTCCAGT GGGAACGAAA ACATTGGGTC GTATCATGAA 200 

25 CGTATTGGGT GAACCAATCG ATGAGCAAGG TGAAATCGGT GCAGAAGAGA 250 

ATTGGTCTAT TCACCGTGCG CCACCAAGTT ATGAAGAACA ATCTAACAGT 300 

ACTGAACTTT TAGAAACGGG AATTAAAGTT ATCGACTTAG TTTGTCCGTT 350 

TGCGAAAGGG GGTAAAGTAG GTTTATTCGG TGGTGCGGGT GTCGGTAAAA 400 

CCGTCAATAT GATGGAATTA ATCCGTAACA TCGCAATTGA GCACTCAGGT 450 

30 TACTCTGTCT TTGCGGGGGT AGGTGAGCGT ACGCGTGAAG GTAACGACTT 500 

CTATCATGAG ATGAAAGACT CTAACGTATT AGATAAAGTG TCTCTTGTTT 550 

ATGGTCAAAT GAACGAGCCA CCAGGTAACC GTTTACGTGT GGCATTAACA 600 

GGCTTAACTA TGGCGGAAAA ATTCCGTGAT GAAGGTCGTG ATGTCTTATT 650 

CTTCGTTGAT AATATTTATC GTTATACTCT TGCTGGTACA GAAGTTTCTG 7 00 

35 CATTATTAGG TCGTATGCCA TCTGCGGTAG GTTATCAACC AACCCTTGCA 750 

GAAGAAATGG GTGTTCTGCA AGAGCGTATT ACCTCAACCA AAACAGGTTC 800 

TATTA 805 



10 



50 



2) INFORMATION FOR SEQ ID NO: 333 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 82 8 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pragia fontium 

(B) STRAIN: ATCC 49100 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333 

TTTCCTCAAG ACGCCGTACC AAAAGTGTAC GACGCGCTTG AAGTTCAGAA 50 

CGATGCCAAG CTGGTGCTGG AAGTTCAACA ACAGCTCGGT GGTGGTGTCG 100 

TTCGTTGTAT CGCAATGGGT ACTTCCGATG GCTTAAGCCG CGGTTTAAAA 150 

50 GTGCTTGATT TAGAACATCC TATCGAAGTA CCGGTTGGTA CTGCGACGCT 2 00 
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GGGCCGTATT ATGAACGTGC TCGGTCAGCC AATCGATATG AAAGGCGATA 250 

TTGGTGAAGA AGAGCGTTGG GCTATTCACC GTGAAGCACC AAGTTATGAA 3 00 

GATTTATCTG GCGCCAATGA ACTGCTGGAA ACGGGTATCA AGGTTATCGA 350 

CCTGATTTGT CCGTTTGCTA AAGGTGGTAA AGTTGGTCTG TTTGGTGGTG 400 

5 CGGGTGTAGG TAAAACCGTA AACATGATGG AGCTGATTCG TAACATTGCG 450 

ACTGAGCACT CAGGTTACTC CGTATTCGCC GGTGTAGGGG AACGTACCCG 500 

TGAAGGTAAT GACTTCTACC ACGAAATGAC CGAATCAAAC GTACTGGATA 550 

AAGTATCTCT GGTTTATGGC CAGATGAACG AGCCACCAGG AAACCGTCTG 600 

CGCGTGGCGT TAACGGGTTT GACCATGGCT GAAAAATTCC GTGATGAAGG 650 

LO TCGTGACGTT CTGTTATTTA TCGATAACAT TTATCGCTAT ACCTTAGCCG 700 

GTACCGAAGT ATCAGCACTG TTGGGGCGTA TGCCATCAGC GGTAGGTTAT 750 

CAGCCAACGT TAGCAGAAGA GATGGGTGTG TTGCAGGAAC GTATTACTTC 800 

AACCAAAACC GGTTCAATCA CTTCTGTA 828 



L5 



30 



2) INFORMATION FOR SEQ ID NO: 334 



(i) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH; 807 bases 

(B) TYPE: Nucleic acid 
<C) STRANDEDNESS: Double 
(D) TOPOLOGY: Linear 

25 1 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mi rati lis 

(B) STRAIN: ATCC 25933 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334 



GTCCCTAAAG TATACGACGC TCTTGAGGTT ATGAATGGTA AAGAAAAACT 50 

GGTGCTGGAA GTTCAGCAAC AGTTAGGCGG TGGTATCGTT CGTTGTATCG 100 

35 CAATGGGTAC ATCAGACGGT TTAAGCCGTG GCTTAAAGGT TGAAGATTTA 150 

GGCCACCCAA TTGAAGTACC AGTAGGTAAA GCGACTTTAG GACGTATCAT 200 

GAACGTTCTG GGTACACCTA TTGATATGAA AGGTGAGATT GAAACCGAAG 250 

AGCGTTGGTC AATCCACCGT GAAGCACCAA CTTACGAAGA GTTATCAAAC 300 

TCTCAAGAAC TGCTTGAAAC CGGTATCAAA GTTATGGACT TAATCTGTCC 350 

10 ATTTGCTAAA GGTGGTAAAG TCGGTCTGTT CGGTGGTGCG GGTGTTGGTA 400 

AAACAGTTAA CATGATGGAA TTGATCCGTA ATATCGCGAT CGAGCACTCA 450 

GGTTACTCTG TATTTGCTGG TGTTGGTGAG CGTACTCGTG AGGGTAACGA 500 

CTTCTATCAT GAAATGACAG ATTCTAACGT TCTTGACAAA GTATCGTTAG 550 

TTTACGGTCA GATGAATGAG CCACCAGGAA ACCGTCTGCG TGTTGCACTG 600 

15 ACTGGTCTGA CTATGGCTGA GAAATTCCGT GATGAAGGCC GTGACGTACT 650 

GTTATTCGTC GATAACATCT ATCGTTACAC CTTAGCCGGT ACAGAAGTAT 700 

CAGCACTGTT AGGTCGTATG CCATCAGCGG TAGGTTACCA ACCAACATTG 750 

GCTGAAGAGA TGGGTGTTCT GCAAGAGCGT ATCACTTCAA CCAAAACAGG 800 

TTCTATC 807 

50 



2) INFORMATION FOR SEQ ID NO: 335 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

200 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
(ii) MOLECULE 



TYPE: 



Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus vulgaris 
5 (B) STRAIN: ATCC 13315 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335 

CCCTAAAGTA TACGACGCTC TTGAGGTTAT GAATGGTAAA GAGAAACTGG 50 

L0 TGCTAGAAGT TCAGCAACAG TTAGGCGGTG GTATCGTTCG TTGTATCGCA 100 

ATGGGTACAT CAGACGGTTT AAGCCGTGGC TTAAAAGTTG AAAACTTAGG 150 

CCACCCAATT GAAGTACCAG TAGGTAAAGC AACACTGGGA CGTATCATGA 2 00 

ACGTTCTGGG TACACCTATC GATATGAAAG GTGATATTGC AACTGAAGAA 250 

CGTTGGTCTA TTCACCGCGA AGCGCCAACC TATGAAGAGT TATCAAGCTC 3 00 

15 TCAAGAACTA CTAGAAACCG GTATCAAAGT AATGGACTTA ATCTGTCCGT 350 

TTGCTAAAGG TGGTAAAGTA GGTCTCTTCG GTGGTGCGGG TGTTGGTAAA 400 

ACAGTTAACA TGATGGAATT GATCCGTAAC ATCGCGATTG AGCACTCAGG 450 

TTATTCTGTA TTTGCAGGTG TTGGTGAGCG TACTCGTGAG GGTAACGACT 500 

TCTATCATGA AATGACAGAT TCTAACGTTC TTGACAAAGT ATCGTTAGTT 550 

20 TATGGTCAGA TGAATGAGCC ACCAGGAAAC CGTCTACGTG TAGCACTGAC 600 

GGGTTTAACC ATGGCGGAAA AATTCCGTGA TGAAGGCCGT GACGTACTGT 650 

TATTCGTCGA TAACATCTAT CGTTACACCT TAGCCGGTAC CGAAGTATCA 70 0 

GCACTGTTAG GCCGTATGCC ATCAGCAGTA GGTTACCAAC CAACATTGGC 750 

TGAAGAGATG GGTGTTCTGC AAGAACGTAT CACTTCAACC AAAACAGGTT 80 0 

25 CAATCACCTC T 811 



2) INFORMATION FOR SEQ ID NO: 33 6 



30 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Providencia alcalifaciens 

(B) STRAIN: ATCC 9886 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336 

15 TCAAGATAAC GTACCAAAAG TGTACGATGC TCTTGAGGTT ATTAACGGTA 50 

AAGAAAAACT GGTGTTGGAA GTTCAACAAC AGTTAGGTGG TGGTGTTGTC 100 

CGTTGTATCG CAATGGGTAC ATCAGATGGT CTGAGCCGTG GTTTAGAAGT 150 

TGTAAACTTA GAGCACCCAA TCGAAGTACC AGTCGGTAAA GCAACTCTGG 200 

GACGTATCAT GAACGTTCTG GGTGAACCAA TCGACATGAA AGGTGATATC 250 

50 GGCGAAGAAG AGCGCTGGTC TATTCACCGT GCTGCACCAA GCTACGAAGA 300 

ATTAGCTAAC TCAACTGAAC TGCTGGAAAC CGGTATCAAA GTAATGGACT 350 

TAATCTGTCC ATTCGCGAAA GGTGGTAAAG TAGGTCTGTT CGGTGGTGCG 400 

GGTGTTGGTA AAACCGTAAA CATGATGGAA CTGATCCGTA ACATCGCGAT 450 

TGAGCACTCA GGTTACTCAG TGTTCGCTGG TGTTGGTGAG CGTACCCGTG 500 

55 AAGGTAACGA CTTCTATCAT GAAATGACAG ACTCAAACGT TCTGGATAAA 550 

GTATCACTGG TTTATGGCCA GATGAACGAG CCACCAGGAA ACCGTCTGCG 600 

TGTTGCGCTG ACTGGTCTGA CTATGGCTGA AAAATTCCGT GACGAAGGTC 650 

GTGACGTACT GCTGTTCGTT GACAACATTT ATCGTTATAC ACTGGCAGGT 700 

ACTGAAGTAT CAGCACTGTT AGGTCGTATG CCATCAGCGG TAGGTTACCA 750 

50 ACCAACGCTG GCGGAAGAGA TGGGTGTTCT TCAAGAACGT ATTACCTCAA 800 

201 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
CTCAAA 



PCT/CA00/01150 



5 2) INFORMATION FOR SEQ ID NO: 337 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 
LO (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

L5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia rettgeri 

(B) STRAIN: ATCC 9250 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337 



TTCCCTCAAG ATGACGTACC AAAAGTGTAC GACGCTCTTG AGGTTGTTAA 50 

CGGTAAAGAA ACACTGGTGC TGGAAGTTCA GCAACAGTTA GGCGGTGGTG 100 

TTGTCCGTTG TATCGCAATG GGTACATCAG ATGGCCTGAG CCGTGGTTTA 150 

GAAGTTGTAA ACTTAGAGCA CCCAATTGAA GTACCAGTAG GTAAAGCAAC 200 

25 TTTAGGACGT ATCATGAACG TTCTGGGTCA GCCTATTGAT ATGAAAGGTG 250 

ATATCGGCGA AGAAGAGCGC TGGTCAATTC ACCGTGCTGC ACCTAGCTAC 300 

GAAGAGTTAG CTAACTCAAC AGAGCTGCTG GAAACCGGTA TCAAAGTAAT 350 

GGACTTAATC TGTCCATTCG CGAAAGGTGG TAAAGTTGGT CTGTTCGGTG 400 

GTGCGGGTGT TGGTAAAACA GTAAACATGA TGGAACTGAT CCGTAACATC 450 

30 GCGATTGAGC ACTCAGGTTA CTCAGTATTC GCTGGTGTTG GTGAGCGTAC 500 

TCGTGAAGGG AACGACTTCT ATCATGAAAT GACTGACTCA AACGTTCTGG 550 

ATAAAGTATC ACTGGTTTAT GGCCAGATGA ATGAGCCACC AGGAAACCGT 600 

CTGCGCGTTG CGTTGACTGG TCTGACTATG GCTGAAAAAT TCCGTGACGA 650 

AGGTCGTGAC GTACTACTGT TCGTTGACAA CATCTATCGT TATACACTGG 700 

35 CAGGTACTGA AGTATCAGCA CTGTTAGGTC GTATGCCTTC AGCGGTAGGT 750 

TATCAGCCAA CGCTGGCGGA AGAGATGGGT GTTCTGCAAG AACGTATTAC 800 

CTCAACTCAA ACGGGTTCTA TCACTTCCGT 830 



10 



50 



2) INFORMATION FOR SEQ ID NO: 338 



( i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 812 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia rustigianii 

(B) STRAIN: ATCC 33673 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338 

AGCGTACCAA AAGTGTACGA TGCTCTTGAG GTTATTAACG GTAAAGAAAA 50 

ACTGGTGTTG GAAGTTCAGC AGCAGTTAGG CGGTGGTGTT GTCCGTTGTA 100 

TCGCAATGGG TACATCAGAT GGTCTGAGCC GTGGTTTAGA AGTTGTAAAC 150 

50 TTAGAACACC CAATTGAAGT ACCAGTAGGT AAAGCAACTC TGGGACGTAT 200 

202 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



CATGAACGTT 
AAGAGCGCTG 
AACTCAACAG 
TCCATTCGCG 
GTAAAACAGT 
TCAGGTTACT 
CGACTTCTAT 
TGGTTTATGG 
CTGACTGGTC 
ACTGCTGTTC 
TATCAGCACT 
TTGGCAGAAG 
CGGTTCTATC 



CTGGGTGACC 
GTCTATTCAC 
AACTGCTAGA 
AAAGGTGGTA 
AAACATGATG 
CAGTATTCGC 
CATGAAATGA 
CCAGATGAAC 
TGACTATGGC 
GTTGACAACA 
GTTAGGTCGT 
AGATGGGTGT 
AC 



CTATTGATAT 
CGTTCAGCGC 
AACCGGTATC 
AAGTTGGTCT 
GAACTGATCC 
TGGTGTTGGT 
CTGATTCTAA 
GAGCCACCAG 
TGAAAAATTC 
TTTATCGTTA 
ATGCCTTCAG 
TCTACAAGAA 



GAAAGGTGAT ATCGGCGAAG 
CAAGCTATGA AGAATTAGCT 
AAAGTAATGG ACTTAATCTG 
GTTCGGTGGT GCGGGTGTTG 
GTAACATCGC GATTGAGCAC 
GAGCGTACCC GTGAAGGTAA 
CGTTCTGGAT AAAGTATCAC 
GAAACCGTCT GCGTGTTGCG 
CGTGACGAAG GTCGTGACGT 
TACACTGGCA GGTACTGAAG 
CGGTAGGTTA TCAGCCAACA 
CGTATCACTT CTACCAAAAC 



2"5<J 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
812 



LO 



15 



20 



25 



30 



35 



10 



15 



50 



55 



2) INFORMATION FOR SEQ ID NO: 339 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia stuartii 

(B) STRAIN: ATCC 33672 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339 

TCAAGATGCA GTACCAAAAG TGTACGATGC GCTTGAGGTT GTTAACGGTA 50 

AAGAAAAACT GGTGCTGGAA GTTCAGCAAC AGTTAGGCGG TGGTGTTGTC 100 

CGTTGTATCG CAATGGGTAC ATCAGATGGC CTAAGCCGTG GTTTAGAAGT 150 

TAAAAATTTA GAACACCCAA TTGAAGTACC AGTAGGTAAA GCAACACTCG 200 

GACGTATCAT GAACGTTCTG GGTGACCCTA TTGATATGAA AGGTGATATC 250 

GGCGAAGAAG AGCGTTGGTC TATTCACCGC GCTGCACCAA GCTACGAAGA 300 

GCTATCGAGC TCAACTGAAC TGCTAGAGAC AGGTATCAAA GTCATGGACT 350 

TGATCTGTCC ATTCGCGAAA GGTGGTAAAG TTGGTCTGTT CGGTGGTGCG 400 

GGTGTTGGTA AAACGGTAAA CATGATGGAA CTTATCCGTA ACATCGCGAT 450 

TGAGCACTCA GGTTACTCAG TATTCGCAGG TGTTGGTGAG CGTACCCGTG 500 

AAGGTAACGA CTTCTATCAT GAAATGACAG ATTCAAACGT TCTTGACAAA 550 

GTATCACTGG TTTATGGTCA GATGAATGAG CCACCAGGAA ACCGTCTACG 600 

CGTAGCATTG ACTGGTTTGA CTATGGCTGA GAAATTCCGT GACGAAGGCC 650 

GTGATGTTCT GTTGTTCGTG GATAACATCT ATCGTTATAC ACTGGCAGGT 700 

ACAGAAGTAT CGGCTCTGTT AGGTCGTATG CCATCAGCAG TAGGTTATCA 750 

GCCAACATTG GCAGAAGAGA TGGGTGTTCT TCAAGAACGT ATCACTTCTA 800 

CTAAGACAGG TTCTATCAC 819 



2) INFORMATION FOR SEQ ID NO: 340 

( i ) S EQUENCE CHARACTER I ST I CS : 

(A) LENGTH: 907 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



203 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Psychrobacter phenylpyruvicus 
5 (B) STRAIN: ATCC 23333 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340 

AACCGCACTG ACGTGCCTCA AATTTATGAC GCGTTAGTTG TAGATGGTAC 50 

10 CGAAACCACC CTAGAAGTTC AGCAGCAGCT GGGCGATGGT GTGGTACGTA 100 

CTATTGCAAT GGGATCTACT GAAGGTCTTA AGCGTGGTTT ACCAGTAACA 150 

AACACTGGTG CCCCAATTAC AGTTCCAGTA GGTGATGCGA CTTTAGGTCG 200 

TATTATGGAC GTTTTAGGTC GTCCAATTGA CGAACAAGGT CCAGTTAATT 250 

CTGAAGACCA TTGGTCAATC CACCGTCAAG CGCCATCATA TGATGAGCAA 300 

15 GCTAACAGTA CTGACCTATT AGAGACAGGT ATTAAAGTAA TTGACTTACT 350 

TTGTCCGTTC GCTAAAGGGG GTAAAGTTGG TCTGTTCGGT GGTGCCGGTG 4 00 

TTGGTAAAAC CGTAAACATG ATGGAATTGA TTAATAACAT CGCTCTTAAG 450 

CACTCAGGTT TATCAGTATT CGCTGGTGTG GGTGAGCGTA CTCGTGAAGG 500 

TAACGACTTC TACCACGAGA TGCAAGAAGC GGGTGTTGTT GACGTTGAAA 550 

2 0 ACTTCACCAA CTCAAAAGTT GCGATGGTTT ATGGTCAGAT GAATGAGCCA 600 

CCAGGTAACC GTTTACGTGT TGCGTTAACC GGTCTGACTA TGGCTGAGTA 650 

CTTCCGTGAT CAAAAAGATG AAAACGGTAA AGGTAAAGAC GTTCTATTAT 700 

TCGTTGATAA CATCTACCGC TACACGCTAG CCGGTACTGA AGTATCAGCA 750 

CTTCTAGGTC GTATGCCATC AGCAGTAGGT TATCAGCCAA CACTAGCGGA 800 

25 AGAGATGGGT GTACTACAAG AGCGTATTAC TTCAACTCAG ACTGGTTCTA 850 

TTACTTC 907 



30 2) INFORMATION FOR SEQ ID NO: 341 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rahnella aquatilis 

(B) STRAIN: DSM 4 594 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341 



GTTCCCTCAG GATGCAGTAC CGAACGTGTA CAATGCTCTT GAGGTAGAAA 50 

ACGGTACCTC CAAACTGGTG CTGGAAGTTC AGCAACAGTT AGGCGGCGGC 100 

GTTGTTCGTT GTATCGCAAT GGGTACCTCA GACGGCCTGC GTCGCGGTCT 150 

GAAAGTGAAC AACCTGGAAC ACCCAATTGA AGTACCGGTA GGTAAAGCGA 200 

50 CTCTGGGTCG TATCATGAAC GTATTGGGTG AACCAATCGA CATGAAAGGT 250 

GAAATCGGCG AAGAAGAACG TCGTGCTATT CACCGTGCTG CGCCTTCTTA 3 00 

TGAAGAGCTG GCAAACTCCC AGGAATTGCT GGAAACCGGT ATCAAAGTTA 350 

TGGACCTGAT GTGTCCGTTC GCTAAGGGCG GTAAAGTTGG TCTGTTCGGT. 400 

GGTGCGGGTG TAGGTAAAAC TGTGAACATG ATGGAGCTGA TCCGTAACAT 450 

55 TGCGATCGAG CACTCCGGTT ATTCTGTGTT TGCAGGCGTG GGTGAACGTA 500 

CTCGTGAGGG TAACGACTTC TACCACGAAA TGACTGATTC CAACGTTATC 550 

GACAAAGTTT CCCTGGTGTA TGGCCAGATG AATGAGCCAC CAGGTAACCG 600 

TCTGCGCGTT GCACTGACCG GCCTGACCAT GGCGGAAAAA TTCCGTGATG 650 

AAGGTCGTGA CGTACTGCTG TTCGTTGACA ACATTTACCG TTACACCCTG 700 

60 GCCGGTACCG AAGTGTCCGC ACTTCTGGGC CGTATGCCAT CGGCGGTAGG 750 

204 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



TTATCAGCCA ACGCTGGCGG AAGAGATGGG 
CCTCGACCAA AAGTGGTTCT ATCACCTCCG 



CGCTCTGCAA GAACGTATCA* 
TA 



800 
832 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 342 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. arizonae 

(B) STRAIN: ATCC 13314 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCTCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTT 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCCAC 200 

GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAT ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCTATTC ACCGCGCGGC GCCGTCCTAC 3 00 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TAAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCT AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCACC GGGAAACCGT 600 

CTGCGCGTTG CATTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGATGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTCGACAA CATCTACCGT TATACCCTCG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

TAYCASCCGA CTCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACCAAA ACCGGTTCTA TCAC 824 



2) INFORMATION FOR SEQ ID NO: 343 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp, choleraesuis 
serotype Choleraesuis 

(B) STRAIN: ATCC 7001 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343 

GATGCCGTAC CACGCGTGTA CGATGCCCTT GAGGTGCAGA ATGGTAATGA 50 
GAAGCTGGTG CTGGAAGTTC AGCAGCAGCT TGGCGGCGGT ATCGTGCGTA 100 



205 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

CCATCGCGAT GGGGTCTTCT GACGGTCTGC GTCGCGGTCT * GGATGTAAA^ 150 

GATCTCGAAC ACCCGATCGA AGTCCCGGTA GGTAAAGCCA CGCTGGGTCG 200 

TATCATGAAC GTCCTGGGCG AACCGGTCGA CATGAAAGGC GAGATCGGCG 250 

AAGAAGAGCG TTGGGCGATT CACCGCGCAG CACCTTCCTA CGAAGAGTTG 3 00 

5 TCAAACTCTC AGGAACTGCT GGAAACCGGT ATCAAAGTTA TCGACCTGAT 3 50 

GTGTCCGTTC GCGAAGGGCG GTAAAGTCGG TCTGTTCGGT GGTGCGGGTG 400 

TAGGTAAAAC CGTAAACATG ATGGAGCTTA TTCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ACTCAGTGTT TGCGGGCGTA GGGGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGACCGACTC CAACGTTATC GATAAAGTAT 550 

10 CCCTGGTGTA TGGCCAGATG AACGAGCCGC CGGGAAACCG TCTGCGCGTT 600 

GCACTGACCG GCCTGACCAT GGCGGAGAAA TTCCGTGACG AAGGTCGTGA 650 

TGTACTGCTG TTCGTCGATA ACATCTATCG TTACACCCTG GCCGGTACGG 700 

AAGTATCCGC ACTGCTGGGC CGTATGCCTT CCGCAGTAGG TTACCAGCCG 750 

ACTCTGGCGG AAGAGATGGG CGTTCTGCAG GAACGTATCA CCTCCACCAA 800 

15 AACCGGTTCT ATCACCTCCG 820 



2) INFORMATION FOR SEQ ID NO: 344 

20 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Salmonella, choleraesuis subsp. diarizonae 

(B) STRAIN: ATCC 43973 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344 

35 TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCTCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 
TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTT ' 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC 200 

GCTGGGTCGT ATCATGAACG TCCTGGGTGA ACCGGTCGAT ATGAAAGGCG 250 

40 AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCGGC GCCGTCCTAC 300 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 4 50 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

45 TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCT AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCACC GGGAAACCGT 600 

CTGCGCGTTG CATTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGATGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTCGACAA CATCTACCGT TATACCCTCG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

50 TATCAGCCGA CTCTGGCTGA AGAAATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACCAAA ACCGGTTCTA TCACCTCCGT A 831 



55 2) INFORMATION FOR SEQ ID NO: 345 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

206 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. houtenae 

(B) STRAIN: ATCC 43 974 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345 



10 



TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCTCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTACGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCTAC 200 

15 GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAT ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCTGC GCCGTCCTAC 3 00 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 3 50 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

20 GCGATCGAGC ACTCCGGTTA CTCCGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGTCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CATTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGACGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG 700 

25 CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

TATCAGCCGA CGCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 8 00 

CTCCACCAAG ACCGGTTCTA TCACCTCCGT A 831 



3 0 

2) INFORMATION FOR SEQ ID NO: 346 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 82 9 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. indica 

(B) STRAIN: ATCC 43974 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTCGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTG 150 

50 GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCCAC 200 

GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAT ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCGGC GCCGTCCTAT 300 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 4 00 

55 GTGCGGGCGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 50 0 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ACAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAGAAGT TCCGTGACGA 650 

60 AGGTCGTGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TATACCCTGG 700 
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CCGGTACGGA AGTTTCCGCA CTGCTGGGTC 
TATCAGCCAA CTCTGGCGGA AGAGATGGGC 
CTCCACCAAA ACCGGTTCTA TCACCTCCG 



GTATGCCTTC CGCG&TAGGT 
GTTCTGCAGG AACGTATCAC 



75t) • 

800 

829 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 347 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Paratyphi A 

(B) STRAIN: ATCC 9150 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 7 

GCCGTACCAC GCGTGTACGA TGCCCTTGAG GTGCAGAATG GTAATGAGAA 50 

GCTGGTGCTG GAAGTTCAGC AGCAGCTTGG CGGCGGTATC GTGCGTACCA 100 

TCGCGATGGG GTCTTCTGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAT 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCTACGC TGGGTCGTAT 2 00 

CATGAACGTC CTGGGCGAAC CGGTCGACAT GAAAGGCGAG ATCGGCGAAG 250 

AAGAGCGTTG GGCGATTCAC CGCGCAGCGC CTTCCTACGA AGAGTTGTCA 3 00 

AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 3 50 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACCGT AAACATGATG GAGCTTATCC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CATGAAATGA CCGACTCCAA CGTTATCGAT AAAGTATCCC 550 

TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA 600 

CTGACCGGCC TGACCATGGC GGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGTCGT ATGCCTTCCG CGGTAGGTTA CCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAGAC 800 

CGGTTCTATC ACCTCCG 817 



2) INFORMATION FOR SEQ ID NO: 348 . 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Paratyphi B 

(B) STRAIN: ATCC 8759 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348 
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GCCGTACCAC 
GCTGGTGCTG 
TCGCGATGGG 
CTCGAACACC 
CATGAACGTC 
AAGAGCGTTG 
AACTCTCAGG 
TCCGTTCGCG 
GTAAAACCGT 
TCCGGTTACT 
CGACTTCTAC 
TGGTGTATGG 
TTGACCGGTC 
ACTGCTGTTC 
TATCCGCACT 
CTGGCGGAAG 
CGGTTC 



GCGTGTACGA 
GAAGTTCAGC 
GTCTTCTGAC 
CGATCGAAGT 
CTGGGCGAAC 
GGCGATTCAC 
AACTGCTGGA 
AAGGGCGGTA 
AAACATGATG 
CTGTGTTTGC 
CACGAAATGA 
CCAGATGAAC 
TGACCATGGC 
GTCGATAACA 
GCTGGGCCGT 
AGATGGGCGT 



TGCCCTTGAG 
AGCAGCTTGG 
GGTCTGCGTC 
CCCGGTAGGT 
CGGTCGACAT 
CGCGCAGCGC 
AACCGGTATC 
AAGTCGGTCT 
GAGCTTATCC 
GGGCGTAGGT 
CCGACTCCAA 
GAGCCGCCGG 
GGAGAAATTC 
TCTATCGTTA 
ATGCCTTCCG 
TCTGCAGGAA 



GTGCAGAATG 
CGGCGGTATT 
GCGGTCTGGA 
AAAGCCACGC 
GAAAGGCGAG 
CTTCCTACGA 
AAAGTTATCG 
GTTCGGTGGT 
GTAACATCGC 
GAACGTACTC 
CGTTATCGAT 
GAAACCGTCT 
CGTGACGAAG 
CACCCTGGCC 
CAGTAGGTTA 
CGTATCACCT 



GTAATGAGAA 
GTGCGTACCA 
TGTAAAAGAT 
TGGGTCGTAT 
ATCGGCGAAG 
AGAGTTGTCA 
ACCTGATGTG 
GCGGGTGTGG 
GATCGAGCAC 
GTGAGGGTAA 
AAAGTATCCC 
GCGCGTTGCA 
GTCGTGACGT 
GGTACGGAAG 
CCAGCCGACT 
CCACCAAAAC 



*0 

100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
806 



2) INFORMATION FOR SEQ ID NO: 349 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. salamae 

(B) STRAIN: ATCC 43972 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGKGGTCTG 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC 2 00 

GCTGGGCCGT ATCATGAACG TACTGGGCGA ACCGGTCGAC ATGAAAGGCG 2 50 

AGATTGGTGA AGAAGAGCGT TGGGCTATTC ACCGTGCTGC GCCGTCCTAC 3 00 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 3 50 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTCAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCCGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGACGA 650 

AGGTCGTGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TATACCCTGG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

TATCAGCCGA CGCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACTAAA ACCGGTTCTA TCACCTCCGT A 831 



2) INFORMATION FOR SEQ ID NO: 350 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 823 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Typhi 
10 (B) STRAIN: ATCC 1074 9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA 50 

15 TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCTAC 2 00 

GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAC ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCAGC GCCTTCCTAC 3 00 

2 0 GAAGAGTTAT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCATGAAAT GACCGACTCC AACGTTATCG 550 

25 ATAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAGAAAT TCCGTGACGA 650 

AGGTCGTGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 7 50 

TACCAGCCGA CTCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

30 CTCCACCAAG ACCGGTTCTA TCA 823 



2) INFORMATION FOR SEQ ID NO: 351 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
4 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 

serotype Typhimurium 
(B) STRAIN: ATCC 14028 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351 



TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TTGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCTAC 2 00 

55 GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAC ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCAGC GCCTTCCTAC 300 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 3 50 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 4 00 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

60 GCGATCGAGC ACTCCGGTTA CTCAGTGTTT GCGGGCGTAG GGGAACGTAC 500 
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TCGTGAGGGT AACGACTTCT ACCACGAAAT 
ATAAAGTATC CCTGGTGTAT GGCCAGATGA 
CTGCGCGTTG CATTGACCGG TCTGACCATG 
AGGTCGTGAC GTACTGCTGT TCGTCGATAA 
CCGGTACGGA AGTATCCGCA CTGCTGGGCC 
TACCAGCCGA CTCTGGCGGA AGAGATGGGC 
CTCCACCAAA ACCGGTTCTA TCACCTCCGT 



GACCGACTCC ffAOCTTATC© 
ACGAGCCGCC GGGAAACCGT 
GCGGAGAAAT TCCGTGACGA 
CATCTATCGT TACACCCTGG 
GTATGCCTTC CGCAGTAGGT 
GTTCTGCAGG AACGTATCAC 
A 



13 



600 
650 
700 
750 
800 
831 



2) INFORMATION FOR SEQ ID NO: 352 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Virchow 

(B) STRAIN: ATCC 51955 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352 

GCCGTACCAC GCGTGTACGA TGCCCTTGAG GTGCAGAATG GTAATGAGAA 50 

GCTGGTGCTG GAAGTTCAGC AGCAGCTTGG CGGCGGTATC GTGCGTACCA 100 

TCGCGATGGG GTCTTCTGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAT 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCCACGC TGGGTCGTAT 2 00 

CATGAACGTC CTGGGCGAAC CGGTCGACAT GAAAGGCGAG ATCGGCGAAG 250 

AAGAGCGTTG GGCGATTCAC CGCGCAGCAC CTTCCTACGA AGAGTTGTCA 3 00 

AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACCGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGG GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAT AAAGTCTCCC 550 

TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA 600 

TTGACCGGTC TGACCATGGC GGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGCCGT ATGCCTTCCG CAGTAGGTTA CCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAAAC 800 

CGGTTCTATC 810 



2) INFORMATION FOR SEQ ID NO: 353 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Serratia ficaria 

(B) STRAIN: ATCC 33105 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353 

GATGCCGTAC CAAAAGTGTA CGATGCCCTT GAGGTAGAAA ACGGTACCGA 50 

5 GAAACTGGTG CTGGAAGTTC AGCAGCAGCT GGGCGGTGGC GTCGTTCGCT 100 

GTATCGCAAT GGGGACCTCT GACGGTCTGC GTCGCGGTCT GAAAGTGAAC 150 

AACCTGGAAC ACCCGATTGA AGTGCCGGTG GGTAAAGCTA CCCTGGGCCG 200 

TATCATGAAC GTATTGGGCG AACCAATCGA CATGAAAGGC GAGATCGGCG 250 

AAGAAGAGCG TTGGGCGATT CACCGTCCTG CGCCAAGCTA CGAAGAGCTG 300 

10 TCCAACTCCC AGGACCTGCT GGAAACCGGT ATCAAGGTAA TGGACCTGAT 350 

TTGTCCGTTC GCCAAGGGCG GTAAAGTCGG TCTGTTCGGT GGTGCGGGCG 400 

TGGGCAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ATTCCGTGTT TGCGGGCGTG GGCGAGCGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGAACGACTC CAACGTTCTG GACAAAGTAT 550 

15 CCCTGGTTTA CGGCCAGATG AACGAGCCGC CGGGTAACCG TCTGCGCGTT 600 

GCATTGACCG GCCTGACCAT GGCGGAGAAA TTCCGTGACG AAGGCCGCGA 650 

CGTTCTGCTG TTCGTTGACA ACATTTACCG TTACACCCTG GCCGGTACCG 700 

AAGTGTCCGC ACTTCTGGGC CGTATGCCAT CCGCGGTAGG TTATCAGCCA 750 

ACGCTGGCGG AAGAGATGGG CGTTCTGCAA GAACGTATCA CCTCGACCAA 800 

20 GACCGGTTCC ATCACCTCCG 820 



2) INFORMATION FOR SEQ ID NO: 354 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Serratia fonticola 

(B) STRAIN: ATCC 2 9844 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354 

40 GATGCCGTAC CGAAAGTGTA CAACGCTCTT GAGGTTGCAA ACGGCACCGA 50 

GAAATTGGTG CTGGAAGTTC AGCAACAGCT GGGTGGCGGC GTGGTTCGCT 10 0 

GTATCGCAAT GGGGACCTCA GACGGTCTGC GTCGTGGTCT GGCCGTAACC 150 

GACCTGCAGC ACCCAATTGA AGTACCGGTA GGTAAAGCTA CCCTGGGCCG 2 00 

TATCATGAAC GTATTGGGTG AACCAATCGA CATGAAGGGC GACATCGGCG 250 

45 AAGAAGAACG TTGGGCTATT CACCGCCCTG CGCCAAGCTA CGAAGAGCTG 300 

TCCAGCTCCC AGGATCTGCT GGAAACCGGT ATCAAGGTAA TGGACCTGAT 350 

CTGCCCGTTC GCCAAGGGTG GTAAAGTTGG TCTGTTCGGT GGTGCTGGTG 4 00 

TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ATTCTGTGTT TGCGGGCGTG GGTGAACGTA CTCGTGAGGG 500 

50 TAACGACTTC TACCACGAAA TGACCGATTC CAACGTACTG GACAAAGTTT 550 

CCCTGGTTTA CGGCCAGATG AACGAGCCAC CAGGTAACCG TCTGCGCGTT 600 

GCGCTGACCG GCCTGACCAT GGCTGAGAAG TTCCGTGACG AAGGTCGTGA 650 

CGTACTGCTG TTCGTCGATA ACATCTACCG TTATACCCTG GCCGGTACCG 700 

AAGTGTCCGC ACTTCTGGGC CGTATGCCAT CGGCGGTAGG TTATCAGCCA 750 

55 ACGCTGGCGG AAGAGATGGG TGTTCTGCAA GAACGTATCA CCTCTACCAA 800 

GACTGGTTCA ATCACC ' 816 



60 2) INFORMATION FOR SEQ ID NO: 355 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia grimes ii 

(B) STRAIN: ATCC 14460 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355 

GATGCCGTAC CAAAAGTGTA CAACGCTCTT GAGGTAGAAA ACGGTGCCAA 50 

TAAGCTGGTG CTGGAAGTTC AGCAACAGTT GGGCGGTGGC GTGGTTCGCT 100 

GTATCGCAAT GGGGACCTCT GATGGTCTGC GTCGCGGTCT GAAAGTCACA 150 

GACCTGGACC ACCCAATTGA AGTACCGGTA GGTAAAGCTA CTCTGGGCCG 200 

TATCATGAAC GTATTGGGTG AACCAATCGA CATGAAGGGC GATATCGGCG 250 

AAGAAGAACG TTGGGCGATT CACCGTCCGG CGCCAAGCTA CGAAGATTTG 300 

GCCAACTCCC AGGATCTGCT GGAAACCGGT ATCAAGGTAA TGGACCTGAT 350 

CTGCCCGTTC GCCAAGGGTG GTAAAGTCGG TCTGTTCGGT GGTGCGGGTG 400 

TTGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ATTCTGTGTT TGCGGGCGTG GGTGAGCGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGAACGACTC CAACGTACTG GACAAAGTAT 550 

CCCTGGTTTA CGGCCAGATG AACGAGCCAC CGGGTAACCG TCTGCGCGTT 600 

GCTCTGACCG GTCTGACCAT GGCTGAGAAA TTCCGTGACG AAGGCCGTGA 650 

CGTTCTGCTG TTCGTTGATA ACATCTACCG TTATACCCTG GCCGGTACCG 700 

AAGTGTCCGC ACTTCTGGGC CGTATGCCAT CGGCGGTAGG TTATCAGCCA 750 

ACGCTGGCGG AAGAGATGGG TGTTCTGCAA GAACGTATCA CCTCTACCAA 800 

GACTGGTTCA ATCACCTCCG TA 822 



2) INFORMATION FOR SEQ ID NO: 356 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia liquefaciens 

(B) STRAIN: ATCC 27592 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 56 

ATGCCGTACC AAAAGTGTAC AATGCTCTTG AGGTAGAAAA CGGTACCGAG 50 

AAGCTGGTGC TGGAAGTTCA GCAGCAGTTG GGCGGTGGCG TAGTTCGCTG 100 

TATCGCGATG GGGACCTCAG ATGGTCTGCG CCGCGGTCTG AAAGTGAACG 150 

ATCTGGAACA CCCAATTGAA GTACCGGTAG GTAAAGCTAC CCTGGGCCGT 2 00 

ATCATGAACG TATTGGGTGA ACCAATCGAC ATGAAAGGCG ACATCGGCGA 250 

AGAAGAACGT TGGGCGATTC ACCGTCCAGC GCCAAGCTAC GAAGATTTGT 300 

CAAACTCCCA GGATCTGCTG GAAACCGGTA TCAAGGTTAT GGACCTGATT 350 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCTGGTGT 400 

TGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATCGAGC 450 
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ACTCCGGTTA TTCCGTGTTT 
AACGACTTCT ACCACGAAAT 
CCTGGTTTAC GGCCAGATGA 
CTCTGACCGG TCTGACCATG 
GTTCTGCTGT TCGTTGATAA 
AGTGTCCGCA CTTCTGGGCC 
CGCTGGCGGA AGAGATGGGC 
ACCGGTTCTA TCACTTCCG 



GCAGGCGTGG 
GAACGACTCC 
ACGAGCCACC 
GCGGAGAAAT 
CATTTACCGT 
GTATGCCATC 
GTTCTGCAAG 



GTGAGCGTAC TCGTGAGGGT 
AACGTACTGG ACAAAGTATC 
GGGTAACCGT CTGCGCGTTG 
TCCGTGACGA AGGCCGCGAC 
TATACCCTGG CCGGTACCGA 
TGCGGTAGGT TATCAGCCAA 
AACGTATCAC CTCTACCAAG 



500 
550 
600 
650 
700 
750 
800 
819 



2) INFORMATION FOR SEQ ID NO: 357 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia marcescens 

(B) STRAIN: ATCC 13880 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 57 

TCAGGATGCC GTACCGAAAG TGTACGACGC CCTTGAGGTA GAAAACGGCA 50 

CCGAAAAACT GGTGTTGGAA GTTCAGCAAC AGCTGGGCGG TGGCGTGGTT 100 

CGCTGTATCG CAATGGGGAC CTCCGACGGT CTGCGTCGCG GTCTGAAAGT 150 

GAACAACCTG GACCACCCGA TTGAAGTGCC GGTGGGTAAA GCTACCCTGG 2 00 

GTCGTATCAT GAACGTATTG GGTCAACCGA TCGACATGAA AGGCGACATC 250 

GGCGAAGAAG AGCGTTGGGC GATTCACCGC GCGGCGCCAA GCTACGAAGA 300 

GCTGTCAAGC TCTCAGGAAC TGCTGGAAAC CGGTATCAAG GTAATGGACC 350 

TGATTTGTCC GTTCGCCAAG GGCGGTAAAG TCGGTCTGTT CGGCGGTGCG 400 

GGCGTAGGTA AAAC CGTAAA CATGATGGAG CTGATCCGTA ACATCGCGAT 450 

CGAGCACTCC GGTTATTCCG TGTTTGCGGG CGTGGGCGAG CGTACTCGTG 500 

AGGGTAACGA CTTCTACCAC GAAATGACCG ACTCCAACGT TCTGGACAAA 550 

GTATCCCTGG TTTACGGCCA GATGAACGAG CCACCAGGTA ACCGTCTGCG 600 

CGTTGCGCTG ACCGGTCTGA CCATGGCGGA GAAATTCCGT GACGAAGGCC 650 

GTGACGTTCT GCTGTTCGTT GACAACATCT ACCGTTACAC CCTGGCCGGT 700 

ACCGAAGTGT CCGCACTTCT GGGCCGTATG CCATCCGCGG TAGGTTATCA 750 

GCCAACGCTG GCGGAAGAGA TGGGCGTTCT GCAAGAACGT ATCACCTCGA 800 

CCAAG 805 



2) INFORMATION FOR SEQ ID NO: 3 58 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia odor if era 

(B) STRAIN: ATCC 33077 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358 



GATGCCGTAC 
AACTCTGGTG 
GTATCGCTAT 
GATCTGCAAC 
TATCATGAAC 
AAGAAGAGCG 
TCCAACTCCC 
CTGCCCGTTT 
TTGGTAAAAC 
CACTCCGGTT 
TAACGACTTC 
CCCTGGTTTA 
GCGCTGACCG 
CGTTCTGCTG 
AAGTGTCTGC 
ACGCTGGCGG 
GACCGGTTCT 



CAAAAGTGTA 
CTGGAAGTTC 
GGGCACCTCC 
ACCCAATCGA 
GTATTGGGTC 
TTGGGCGATT 
AGGATCTGCT 
GCCAAGGGTG 
CGTAAACATG 
ATTCAGTGTT 
TACCACGAAA 
CGGCCAGATG 
GTCTGACCAT 
TTCGTTGACA 
ACTTCTGGGC 
AAGAGATGGG 
ATCACCTCCG 



CGATGCCCTT 
AGCAGCAGCT 
GACGGTTTGC 
AGTCCCGGTT 
AACCAATCGA 
CACCGCGCGG 
GGAAACCGGT 
GTAAAGTCGG 
ATGGAGCTGA 
TGCGGGCGTG 
TGACCGACTC 
AACGAGCCAC 
GGCCGAGAAA 
ACATTTACCG 
CGTATGCCAT 
CGTTCTGCAA 
TA 



GAGGTAGAAA ACGGTACCGA 
GGGCGGTGGC GTGGTTCGTT 
GTCGCGGCCT GAAAGTGAAC 
GGCAAGGCAA CGCTGGGTCG 
CATGAAAGGC GACATCGGCG 
CGCCAAGCTA CGAAGAACTG 
ATCAAGGTTA TGGACCTGAT 
TCTGTTCGGT GGTGCGGGTG 
TCCGTAACAT CGCGATCGAG 
GGTGAGCGTA CTCGTGAGGG 
CAACGTACTG GACAAGGTTT 
CGGGTAACCG TCTGCGCGTT 
TTCCGTGACG AAGGTCGTGA 
TTACACCCTG GCCGGTACCG 
CGGCGGTAGG TTATCAGCCA 
GAACGTATCA CCTCGACCAA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
822 



2) INFORMATION FOR SEQ ID NO: 359 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia plymuthica 

(B) STRAIN: ATCC 183 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359 

GTGTACAACG CTCTTGAGGT AGAAAACGGT GCCAATAAGC TGGTGCTGGA 50 

AGTTCAGCAA CAGCTGGGCG GTGGCGTGGT TCGCTGTATC GCGATGGGGA 100 

CCTCTGATGG TCTGCGTCGC GGTCTGAAAG TGATCGACCT GGATCACCCG 150 

ATTGAAGTAC CGGTAGGTAA AGCTACCCTG GGCCGTATCA TGAACGTATT 2 00 

GGGTGAACCA ATCGACATGA AAGGCGACAT CGGCGAAGAA GAACGTTGGG 250 

CAATTCACCG TCCAGCGCCA AGCTACGAAG ATTTGGCCAA CTCCCAGGAT 3 00 

CTGCTGGAAA CCGGTATCAA GGTTATGGAC CTGATCTGTC CGTTCGCTAA 3 50 

GGGCGGTAAA GTGGGTCTGT TCGGCGGTGC GGGCGTGGGT AAAACCGTAA 4 00 

ACATGATGGA GCTGATCCGT AACATCGCGA TCGAACACTC CGGTTATTCC 450 

GTGTTTGCGG GCGTGGGTGA GCGTACTCGT GAGGGTAACG ACTTCTACCA 500 

CGAAATGAAC GACTCCAACG TACTGGACAA AGTATCCCTG GTTTACGGCC 550 

AGATGAACGA GCCACCGGGT AACCGTCTGC GCGTTGCTCT. GACCGGTCTG 600 

ACCATGGCGG AGAAATTCCG TGACGAAGGC CGCGACGTTC TGCTGTTCGT 650 

TGATAACATC TACCGTTATA CCCTGGCCGG TACCGAAGTG TCCGCACTTC 700 

TGGGCCGTAT GCCATCTGCG GTAGGTTATC AGCCAACGCT GGCGGAAGAG 750 

ATGGGCGTTC TGCAAGAACG TATCACCTCT ACCAAGACCG GTTCTATCAC 800 

CTCCG 805 



2) INFORMATION FOR SEQ ID NO: 360 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia rubidaea 

(B) STRAIN: ATCC 27593 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360 



15 



TTCCCTCAGG ATGCCGTACC AAAAGTGTAC GATGCCCTTG AGGTAGAGAA 50 

CGGTAACGAA AAACTGGTGC TGGAAGTTCA GCAGCAGCTG GGCGGCGGCG 100 

TTGTACGCTG TATCGCCATG GGTACTTCCG ACGGTCTGCG TCGCGGTCTG 150 

AAAGTTAACG ACCTCGAGCA CCCAATCGAA GTGCCAGTTG GTAAAGCAAC 200 

20 GCTGGGTCGT ATCATGAACG TATTGGGTCA GCCAATCGAC ATGAAAGGCG 250 

ACATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCGGC GCCAAGCTAC 300 

GAAGAGCTGT CCAGCTCCCA AGAGCTGCTG GAAACCGGTA TCAAGGTAAT 350 

GGACCTGATC TGCCCGTTCG CCAAGGGTGG TAAAGTTGGT CTGTTCGGTG 400 

GTGCGGGCGT AGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC 450 

25 GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTGG GTGAGCGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTACTGG 550 

ACAAAGTATC CCTGGTTTAC GGCCAGATGA ACGAGCCGCC GGGTAACCGT 600 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGATGA 650 

AGGCCGCGAC GTTCTGCTGT TCGTGGATAA CATCTACCGT TACACCCTGG 700 

3 0 CCGGTACCGA AGTGTCCGCA CTGCTCGGCC GTATGCCATC TGCGGTAGGT 750 

TATCAGCCAA CGCTGGCGGA AGAGATGGGC GTTCTGCAAG AACGTATCAC 800 

CTCGACCAAG ACCGGTTCAA TCACCTCCGT A 831 



35 



45 



2) INFORMATION FOR SEQ ID NO: 361 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 406 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paeudomonas putida 

(B) STRAIN: LCDC D7172 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 61 

CCGTAAACAT GATGGAACTG ATCCGTAACA TCGCCATCGA GCACAGCGGT 50 

TATTCCGTGT TCGCCGGTGT GGGTGAGCGT ACTCGTGAGG GTAACGACTT 100 

CTACCACGAG ATGAAGGACT CCAACGTTCT CGACAAGGTA GCGCTGGTCT 150 

55 ACGGTCAGAT GAACGAGCCA CCAGGAAACC GTCTGCGCGT AGCGCTGACC 2 00 

GGCCTGACCA TGGCCGAGAA GTTCCGTGAC GAAGGTAACG ACGTTCTGCT 2 50 

GTTCGTCGAC AACATCTATC GTTACACCCT GGCCGGTACC GAAGTATCCG 3 00 

CACTGCTGGG CCGTATGCCT TCGGCGGTAG GTTACCAGCC GACCCTGGCT 3 50 

GAAGAGATGG GCGTTCTGCA AGAACGTATC ACTTCGACCA AGGAAGGTTC 4 00 

6 0 GATCAC 4 06 
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2) INFORMATION FOR SEQ ID NO: 362 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella boydii 

(B) STRAIN: ATCC 92 07 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362 

TTCCCTCAGG ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTGCAAAA 50 

TGGTAATGAG CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA 100 

TCGTGCGTAC CATCGCAATG GGTTCCTCCG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ACCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCGAC 2 00 

TCTGGGCCGT ATCATGAACG TACTGGGTGA ACCGGTCGAC ATGAAAGGCG 2 50 

AGATCGGTGA AGAAGAGCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAC 3 00 

GAAGAGCTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG 4 00 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTCAT TCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ACAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CTCTGACCGG TCTGACCATG GCTGAGAAAT TCCGTGACGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTTGACAA CATCTATCGT TACACCCTGG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT 750 

TATCAGCCGA CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACCAAA ACTGGTTCTA TCACCTCCGT A 831 



2) INFORMATION FOR SEQ ID NO: 3 63 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 802 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella dysenteriae 

(B) STRAIN: ATCC 11835 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363 

GCCGTACCGC GCGTGTACGA TGCTCTTGCG GTGCAAAATG GTAATGAGCG 50 

TCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTGCGTACCA 100 

TCGCAATGGG TTCCTCCGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAC 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCGACTC TGGGCCGTAT 2 00 

CATGAACGTA CTGGGTGAAC CGGTCGACAT GAAAGGCGAG ATCGGTGAAG 2 50 
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AAGAGCGTTG GGCTATTCAC 
AACTCTCAGG AACTGCTGGA 
TCCGTTCGCT AAGGGCGGTA 
GTAAAACCGT AAACATGATG 
5 TCCGGTTACT CTGTGTTTGC 
CGACTTCTAC CACGAAATGA 
TGGTGTATGG CCAGATGAAC 
CTGACCGGTC TGACCATGGC 
TCTGCTGTTC GTTGACAACA 
10 TATCCGCACT GCTGGGCCGT 
CTGGCGGAAG AGATGGGCGT 
CG 



CGCGCAGCAC CTTCCTACGA 
AACCGGTATC AAAGTTATCG 
AAGTTGGTCT GTTCGGTGGT 
GAGCTCATTC GTAACATCGC 
GGGCGTAGGT GAACGTACTC 
CCGACTCCAA CGTTATCGAC 
GAGCCGCCGG GAAACCGTCT 
TGAGAAATTC CGTGACGAAG 
TCTATCGTTA CACCCTGGCC 
ATGCCTTCAG CGGTAGGTTA 
TCTGCAGGAA CGTATCACCT 



PCT/CAOO/01150 

AGAGCTGTCA 3 00 

ACCTGATGTG 350 

GCGGGTGTAG 4 00 

GATCGAGCAC 4 50 

GTGAGGGTAA 500 

AAAGTATCCC 550 

GCGCGTTGCT 600 

GTCGTGACGT 650 

GGTACGGAAG 700 

TCAGCCGACC 750 

CCACCAAAAC 800 

802 



15 



2 ) INFORMATION FOR SEQ ID NO: 3 64 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 819 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella flexneri 

(B) STRAIN: ATCC 12 022 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364 

ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTGCAAAA TGGTAATGAG 50 

CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA TCGTGCGTAC 100 

CATCGCAATG GGTTCCTCCG ACGGTCTGCG TCGCGGTCTG GATGTAAAAG 150 

35 ACCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCGAC TCTGGGCCGT 200 

ATCATGAACG TACTGGGTGA ACCGGTCGAC ATGAAAGGCG AGATCGGTGA 250 

AGAAGAGCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAC GAAGAGCTGT 300 

CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT CGACCTGATG 350 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCGGGTGT 400 

40 AGGTAAAACC GTAAACATGA TGGAGCTCAT TCGTAACATC GCGATCGAGC 4 50 

ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT 500 

AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG ACAAAGTATC 550 

CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CTCTGACCGG TCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

45 GTTCTGCTGT TCGTTGACAA CATCTATCGT TACACCCTGG CCGGTACGGA 700 

AGTATCCGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC CTCCACCAAA 800 

ACTGGTTCTA TCACCTCCG 819 

50 

2) INFORMATION FOR SEQ ID NO: 365 

( i ) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 802 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella sonnei 

(B) STRAIN: ATCC 2 993 0 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365 

GTACCGCGCG TGTACGATGC TCTTGAGGTG CAAAATGGTA ATGAGCGTCT 

GGTGCTGGAA GTTCAGCAGC AGCTCGGCGG CGGTATCGTG CGTACCATCG 

10 CAATGGGTTC CTCCGACGGT CTGCGTCGCG GTCTGGATGT AAAAGACCTC 

GAACACCCGA TCGAAGTCCC GGTAGGTAAA GCGACTCTGG GCCGTATCAT 

GAACGTACTG GGTGAACCGG TCGACATGAA AGGCGAGATC GGTGAAGAAG 

AGCGTTGGGC GATTCACCGC GCAGCACCTT CCTACGAAGA GCTGTCAAAC 

TCTCAGGAAC TGCTGGAAAC CGGTATCAAA GTTATCGACC TGATGTGTCC 

15 GTTCGCTAAG GGCGGTAAAG TTGGTCTGTT CGGTGGTGCG GGTGTAGGTA 

AAACCGTAAA CATGATGGAG CTCATTCGTA ACATCGCGAT CGAGCACTCC 

GGTTACTCTG TGTTTGCGGG CGTAGGTGAA CGTACTCGTG AGGGTAACGA 

CTTCTACCAC GAAATGACCG ACTCCAACGT TATCGACAAA GTATCCCTGG 

TGTATGGCCA GATGAACGAG CCGCCGGGAA ACCGTCTGCG CGTTGCTCTG 

20 ACCGGTCTGA CCATGGCTGA GAAATTCCGT GACGAAGGTC GTGACGTTCT 

GCTGTTCGTT GACAACATCT ATCGTTACAC CCTGGCCGGT ACGGAAGTAT 

CCGCACTGCT GGGCCGTATG CCTTCAGCGG TAGGTTATCA GCCGACCCTG 

GCGGAAGAGA TGGGCGTTCT GCAGGAACGT ATCACCTCCA CCAAAACTGG 
TT 

25 



2) INFORMATION FOR SEQ ID NO: 366 

3 0 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 785 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366 

TAACGCCTTG GTTATTGATG TGCCTAAAGA AGAAGGTACA ATACAACTAA 50 

CATTAGAAGT TGCGCTGCAA TTAGGTGACG ACGTTGTTCG TACAATTGCG 100 

45 ATGGATTCAA CTGATGGTGT CCAAAGAGGC ATGGATGTAA AAGATACAGG 150 

CAAAGAAATT AGTGTACCTG TTGGTGATGA AACATTAGGT CGTGTATTTA 2 00 

ATGTACTAGG TGAAACAATT GACCTTAAAG AAGAAATTAG TGATTCTGTT 2 50 

CGCCGCGATC CTATCCATCG TCAAGCACCA GCATTCGATG AACTTTCAAC 3 00 

AGAAGTTCAA ATTTTAGAAA CAGGTATTAA AGTAGTAGAT TTACTAGCAC 3 50 

50 CTTATATTAA AGGTGGTAAA ATCGGATTGT TCGGTGGTGC CGGTGTAGGT 4 00 

AAAACAGTAT TAATCCAAGA ATTAATTAAC AACATCGCTC AAGAGCACGG 4 50 

TGGTATTTCT GTATTCGCCG GTGTAGGTGA ACGTACTCGT GAAGGTAACG 500 

ATTTATACTT CGAAATGAGT GATAGTGGTG TAATTAAGAA AACAGCCATG 550 

GTATTCGGGC AAATGAATGA GCCACCTGGT GCACGTATGC GTGTTGCATT 600 

55 ATCTGGTTTA ACAATGGCTG AATATTTCCG TGACGAACAA GGTCAAGACG 650 

TATTATTATT CATCGATAAC ATTTTCAGAT TTACACAAGC TGGTTCTGAG 700 

GTATCTGCAT TATTAGGTCG TATGCCTTCT GCAGTAGGTT ACCAACCAAC 750 

ACTTGCTACT GAAATGGGAC AATTACAAGA ACGTA 785 

60 

219 



50 
100 
150 
200 
250 
300 
350 
400 
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2) INFORMATION FOR SEQ ID NO: 367 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 843 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus auricularis 

(B) STRAIN: ATCC 33753 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367 

GAACATAACG AAGTGCCTAA TATTAATAAC GCCTTAGTAC TCGATGTTGA 50 

AAGAGAAGAC GGAACAGTGT CTTTAACTTT AGAAGTAGCT TTACAATTAG 100 

GCGATGACGT TGTTCGTACC ATTGCAATGG ATTCAACTGA TGGTGTTAAA 150 
CGTGGTAACG AAGTCAAAGA TACTGGTAAT AGCATTAGCG TACCAGTCGG . 200 

AGACGAAACT TTAGGACGTG TCTTCAACGT TCTAGGTGAA ACAATTGATT 250 

TAGAAGATAA ACTTGATGAT TCTGCGCGAC GTGACCCTAT ACATAGAGAA 300 

GCGCCAGCGT TTGATCAATT AT C AACTCAA GTTGAAATTT TAGAAACAGG 350 

AATTAAAGTT GTTGACTTAT TAGCACCTTA TATTAAAGGT GGTAAAGTTG 400 

GACTCTTCGG TGGTGCCGGT GTTGGTAAAA CCGTTTTAAT CCAAGAATTA 450 

ATCAACAACA TCGCTCAAGA ACACGGTGGT ATTTCAGTCT TTGCCGGTGT 500 

AGGTGAACGT ACACGTGAAG GTAACGACTT GTACTATGAA ATGAGCGACA 550 

GTGGTGTAAT CAAGAAAACA GCCATGGTCT TCGGACAAAT GAACGAACCA 600 

CCTGGCGCAC GTATGCGTGT TGCTTTATCT GGTTTAACAA TGGCTGAATA 650 

TTTCCGTGAT GAACAAGGAC AAGACGTATT GTTATTCATC GACAATATTT 700 

TCCGTTTCAC ACAAGCCGGT TCAGAAGTTT CTGCCTTACT AGGTCGTTTA 750 

CCATCAGCCG TTGGTTATCA ACCTACATTA GCAACAGAAA TGGGACAATT 800 

ACAAGAACGT ATTACTTCAA CAACAAAAGG ATCAGTTACT TCA 843 



2 ) INFORMATION FOR SEQ ID NO: 368 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 84 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus capitis subsp. capitis 

(B) STRAIN: ATCC 27840 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368 

GCTTTGAACA TAATGAAGTT CCTGATATTA ACAATGCCTT ACACATCGAA 50 

GTTCCTAAAG AAGATAGCAC ACTTCATTTA ACTTTAGAAG TTGCACTTCA 100 

ATTAGGTGAC GATGTAGTAC GTACAATCGC AATGGACTCA ACTGACGGCG 150 

TTCAAAGAGG TATGGAAGTT AAAGATACAG GTAAAGATAT TAGCGTACCT 200 

GTTGGTGATG CAACTTTAGG AAGAGTATTT AACGTATTAG GAGAAACAAT 250 

CGATTTAGAT GAAAAGATTG ATGATTCAGT ACGTCGTGAT CCTATTCATA 300 

GACAGGCACC TGGCTTCGAT GAATTATCTA CTAAAGTAGA AATCTTAGAA 350 
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ACAGGTATCA AAGTAGTAGA CTTATTAGCA 
AATTGGATTA TTCGGTGGTG CCGGTGTTGG 
AACTTATCAA TAATATCGCT CAAGAGCATG 
GGTGTTGGTG AACGTACACG TGAAGGTAAC 
5 CGATAGTGGT GTAATTAAGA AAACAGCGAT 
AGCCACCTGG TGCTCGTATG CGTGTTGCAT 
GAATATTTCC GTGATGAAGA AGGCCAAGAC 
TATCTTCAGA TTCACACAAG CTGGTTCTGA 
GTATGCCTTC AGCCGTTGGT TATCAACCAA 
10 CAATTACAAG AACGTATTAG TTCAACTAAT 



PCT/CA00/01150 

CCTTACATTA AAGGTGGTAA 400" 

TAAGACAGTT TTAATCCAAG 450 

GTGGTATTTC AGTATTCGCC 500 

GACCTTTACT ATGAAATGAG 550 

GGTATTCGGT CAGATGAACG 600 

TATCAGGTTT AACAATGGCA 650 

GTATTATTAT TCATTGATAA 700 

AGTTTCAGCA TTACTTGGAC 750 

CACTTGCTAC TGAAATGGGT 800 

AAAGGTTCTG TTACTTCAA 849 



2 ) INFORMATION FOR SEQ ID NO: 369 



15 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Staphylococcus cohnii 

(B) STRAIN: DSM 20260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369 

3 0 GAAGTTCCAG AAATTAATAA TGCCTTAGTT CTCGATATAG AAAGAGAAGA 50 

AGGTACTGTT GAATTAACGT TAGAAGTTGC ATTACAACTT GGTGATGACG 100 

TAGTACGAAC AATCGCAATG GATTCAACTG ATGGTGTTAA ACGTGGTACA 150 

GAAGTTAGAG ATAGCGGAAA TAGTATCAGC GTACCAGTTG GTAATGAAAC 200 

ATTAGGTAGA GTATTTAATG TATTAGGTGA TACGATAGAT TTAGATGAAG 250 

3 5 ACATAGATGA CTCAGTGCGT CGTGACCCAA TTCATAGAGA AGCACCTGCA 3 00 

TTTGATCAGT TATCTACTAA AGTTGAAATT TTAGAAACAG GTATCAAAGT 350 

CATTGATTTA TTAGCACCAT ATATCAAAGG TGGTAAAGTT GGATTATTCG 4 00 

GTGGTGCCGG TGTTGGTAAA ACTGTATTAA TTCAAGAATT AATCAATAAT 450 

ATCGCTCAAG AGCATGGTGG TATATCCGTA TTTGCTGGTG TAGGTGAGCG 500 

40 TACGCGTGAA GGTAATGACC TATACTTTGA AATGAGTGAT AGTGGTGTTA 550 

TTAAAAAGAC AGCTATGGTA TTTGGACAAA TGAACGAACC ACCTGGTGCG 600 

CGTATGCGAG TAGCACTTTC TGGTTTAACA ATGGCTGAAT ATTTCCGGGA 650 

TGAACAAGGA CAAGATGTTC TATTATTCAT AGATAACATC TTTAGATTTA 700 

CTCAAGCTGG TTCAGAAGTT TCTGCGTTAT TAGGTCGTAT GCCTTCAGCT 750 

4 5 GTTGGTTACC AACCAACGTT AGCAAC TGAA ATGGGACAAT TACAAGAACG 800 

TATTACTTCT ACAACTAAAG GTTCAGTAAC 830 



50 2) INFORMATION FOR SEQ ID NO: 370 

( i ) SEQUENCE CHARACTER I STICS : 

(A) LENGTH: 787 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Staphylococcus epidermid'Ste 

(B) STRAIN: ATCC 14990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370 

5 

AATAATGCAT TACACATCGA AGTTCCTAAA GAAGATGGAG CGCTTCAATT 50 

AACATTAGAA GTTGCACTTC AACTAGGTGA CGATGTAGTT CGTACAATTG 100 

CAATGGACTC AACTGACGGC GTTCAAAGAG GAATGGAAGT TAAAGATACA 150 

GGTAGAGACA TAAGTGTACC TGTCGGTGAC GTAACTCTAG GAAGAGTGTT 2 00 

10 TAACGTACTA GGAGAAACTA TTGACTTAGA TGAAAAAATT GATGATTCAG 250 

TACGACGTGA CCCTATCCAT AGACAAGCTC CAGGATTCGA CGAATTATCA 300 

ACAAAAGTAG AAATCTTAGA AACTGGTATT AAAGTAGTAG ACTTATTAGC 350 

ACCTTACATA AAAGGTGGTA AAATTGGATT ATTTGGTGGT GCCGGTGTAG 4 00 

GTAAAACCGT ACTAATCCAA GAACTTATTA ATAACATCGC TCAAGAACAC 450 

15 GGTGGTATCT CAGTATTCGC TGGTGTTGGT GAACGTACAC GTGAAGGTAA 500 

TGATCTTTAC TATGAAATGA GTGACAGTGG TGTTATCAAG AAAACTGCAA 550 

TGGTCTTTGG TCAAATGAAT GAGCCACCTG GTGCACGTAT GCGTGTAGCA 600 

TTATCCGGAT TAACAATGGC CGAATATTTC CGAGATGAAG AAGGCCAAGA 650 

TGTGTTATTA TTCATTGATA ACATTTTCAG ATTCACTCAA GCTGGTTCAG 700 

20 AAGTTTCTGC GTTATTAGGT CGTATGCCAT CAGCTGTTGG TTATCAACCT 750 

ACACTTGCTA CAGAAATGGG TCAATTACAA GAACGTA 7 87 



25 2) INFORMATION FOR SEQ ID NO: 371 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: ATCC 29970 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371 



GAAGTACCTG AAATTAATAA CGCCTTAATC ATCGAAGTTC CCAAAGAAGA 50 

TGGTACTTTT GAATTAACGC TTGAAGTTGC ATTACAACTA GGTGATGACG 100 

TTGTTCGTAC AATTGCTATG GATTCAACAG ATGGTGTTCA ACGTGGTATG 150 

GAAGTTCAGA ACACTGGAAA AGACATTTCA GTACCAGTTG GCGAAGTAAC 200 

45 TTTAGGACGT GTATTTAACG TATTAGGTGA CACAATTGAT TTAGAAGATA 250 

AATTAGATGG TTCAGTAAGA CGTGATCCAA TTCATAGACA ATCACCTAAC 3 00 

TTTGACGAAT TATCTACTGA AGTAGAAATT CTTGAAACTG GAATCAAAGT 3 50 

TGTAGACTTA TTAGCACCAT ACATCAAAGG TGGTAAAATC GGTCTATTTG 400 

GTGGTGCCGG TGTTGGTAAA ACCGTTTTAA TCCAAGAATT GATTAATAAT 450 

50 ATCGCACAAG AACATGGTGG TATCTCAGTA TTTGCTGGTG TAGGTGAACG 500 

TACACGTGAA GGTAACGACC TATATTATGA AATGAGAGAT AGTGGTGTTA 550 

TTAAGAAAAC AGCAATGGTA TTTGGTCAAA TGAACGAGCC ACCTGGTGCA 600 

CGTATGCGTG TGGCACTTTC TGCATTGACA ATGGCTGAGT ATTTCCGTGA 650 

TGAACAAGGA CAAGACGTTC TGTTATTCAT CGATAACATT TTCAGATTTA 700 

55 CTCAAGCAGG TTCAGAAGTA TCAGCATTAT TGGGACGTAT GCCTTCAGCT 750 

GTAGGTTATC AACCTACTTT AGCTACAGAA ATGGGTCAAT TACAAGAACG 800 

TATTACATCA ACGAATAAAG GTTCAGTAAC 83 0 



60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



( i i ) MOLECULE TYPE: Genomic DNA 

10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis subsp. hominis 

(B) STRAIN: ATCC 27844 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372 



TCGAACATAA TGAAGTCCCT GAAATTAATA ATGCCCTAAT TATTGAAGTA 50 

CCCAAAAATG ATGGCACATT TAAATTAACA TTAGAAGTTG CATTGCAACT 100 

AGGTGATGAT GTTGTTCGTA CTATTGCAAT GGATTCAACT GATGGTGTTC 150 

20 AACGTGGTAT GCAAGTTGTG AATACTGGTA AAGATATTAG TGTTCCTGTA 200 

GGTGAAGAAA CACTTGGACG TGTGTTTAAC GTTTTAGGAG AAACAATAGA 250 

TTTAAACGAA AAAATAGATA GTTCTGTTAG ACGTGATCCA ATTCATCGTC 300 

GTCAAC CTAA TTTTGATGAA TTATCTACTG AAGTAGAAAT TCTTGAAACA 350 

GGTATTAAAG TTGTAGACTT ATTAGCACCT TATATTAAAG GTGGTAAGAT 400 

25 TGGTTTATTC GGTGGTGCCG GCGTAGGTAA AACTGTATTA ATTCAAGAAT 450 

TAATCAATAA TATCGCTCAA GAACATGGTG GTATTTCTGT ATTCGCTGGT 500 

GTAGGTGAAC GTACTCGTGA AGGTAACGAT TTATACTATG AAATGAGCGA 550 

TAGTGGCGTT ATCAATAAAA CAGCCATGGT ATTTGGGCAA ATGAATGAGC 600 

CGCCAGGTGC GCGTATGCGT GTTGCTTTAT CAGCATTGAC AATGGCTGAA 650 

30 TATTTCCGTG ATGAACAAGG TCAAGATGTA CTTTTATTCA TTGACAATAT 700 

TTTCCGCTTT ACTCAAGCTG GTTCTGAAGT TTCAGCATTA TTAGGACGTA 750 

TGCCTTCAGC TGTAGGTTAT CAACCTACAT TAGCAACTGA AATGGGTCAA 800 

TTACAAGAAC GTATTACATC TACTAATAAA GGTTCAGTCA CTTCAA 846 



35 



2) INFORMATION FOR SEQ ID NO: 373 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(B) STRAIN: CSG 175 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 73 



TCGAACATAA TGAAGTCCCT GAAATTAATA ATGCCCTAAT TATTGAAGTA 50 

CCCAAAAATG ATGGCACATT CAAATTAACA TTAGAAGTTG CATTGCAACT 100 

55 AGGTGATGAT GTTGTTCGTA CTATTGCAAT GGATTCAACT GATGGTGTTC 150 

AACGTGGTAT GCAAGTTGTG AATACTGGTA AAGATATTAG TGTTCCTGTA 2 00 

GGTGAAGAAA CACTTGGACG TGTGTTTAAC GTTTTAGGAG AAACAATAGA 250 

TTTAAACGAA AAAATAGATA GTTCTGTTAG ACGTGATCCA ATTCATCGTC 300 

GTCAACCTAA TTTTGATGAA TTATCTACTG AAGTAGAAAT TCTTGAAACA 3 50 

60 GGTATTAAAG TTGTAGACTT ATTAGCACCT TATATTAAAG GTGGTAAGAT 4 00 
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TGGTTTATTC GGTGGTGCCG 
TAATCAATAA TATCGCTCAA 
GTAGGTGAAC GTACTCGTGA 
TAGTGGCGTT ATCAATAAAA 
5 CGCCAGGTGC GCGTATGCGT 
TATTTCCGTG ATGAACAAGG 
TTTCCGCTTT ACTCAAGCTG 
TGCCTTCAGC TGTAGGTTAT 
TTACAAGAAC GTATTACATC 



GCGTAGGTAA AACTGTATTA 
GAACATGGTG GTATTTCTGT 
AGGTAACGAT TTATACTATG 
CAGCCATGGT ATTTGGGCAA 
GTTGCTTTAT CAGCATTGAC 
TCAAGATGTA CTTTTATTCA 
GTTCTGAAGT TTCAGCATTA 
CAACCTACAT TAGCAACTGA 
TACTAATAAA GGTTCAGTCA 



PCT/CA00/01150 

ATTCAAGAAT 450 

ATTCGCTGGT 500 

AAATGAGCGA 550 

ATGAATGAGC 600 

AATGGCTGAA 650 

TTGACAATAT 700 

TTAGGACGTA 750 

AATGGGTCAA 800 

CTTCAA 846 



2) INFORMATION FOR SEQ ID NO: 374 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus lugdunensis 
25 (B) STRAIN: ATCC 43809 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374 

ATAATGAAGT GCCTGAAATA AATAATGCGC TCATTGTTGA AATTCCTAAA 50 

3 0 AGTGATACAA CAATCAGTTT AACACTTGAA GTTGCTTTGC AATTAGGTGA 100 

CGATGTTGTA CGTACTATTG CAATGGATTC AACTGATGGC GTTCAACGTG 150 

GTATGGAAGT TCAAAACACA GGTAAAGACA TCAGTGTACC TGTTGGAGAT 200 

GAAACATTAG GAAGAGTATT TAACGTTTTA GGAGAATCTA TTGATTTAGA 250 

AGAAAAGCTA GATGACTCTG TGCGTAGAGA TCCAATTCAT AGACTAGCAC 3 00 

35 CTAAATTTGA TGAATTATCT ACAGAAGTAG AAATTCTTGA AACTGGTATT 350 

AAAGTTGTTG ATTTATTAGC ACCATATATT AAAGGTGGTA AAGTTGGATT 4 00 

GTTTGGTGGT GCCGGAGTAG GTAAAACGGT ATTAATTCAA GAATTAATCA 4 50 

ACAATATTGC TCAAGAACAT GGTGGTATTT CTGTGTTTGC CGGAGTAGGT 500 

GAACGTACAC GTGAAGGTAA TGACTTATAT TATGAAATGA GCGATAGTGG 550 

40 CGTAATTAAG AAAACAGCGA TGGTATTTGG CCAAATGAAT GAACCACCTG 600 

GTGCACGTAT GAGAGTTGCG TTATCTGCCT TAACAATGGC TGAATATTTC 650 

CGTGACGAGC AAGGACAAGA CGTATTGCTG TTTATCGATA ATATATTCCG 700 

TTTTACACAA GCAGGTTCAG AAGTATCTGC ATTACTTGGA CGTATGCCAT 750 

CTGCCGTTGG TTATCAACCA ACATTGGCTA CAGAAATGGG ACAATTGCAA 800 

45 GAAAGAATTA CATCTACAAA TAAAGGTTCT GTAAC 835 



2) INFORMATION FOR SEQ ID NO: 375 



50 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 842 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Staphylococcus saprophyticus 
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(B) STRAIN: ATCC 15305 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375 

GAGCACAATG AAGTTCCAGA AATTAACAAT GCCTTAGTCG TAGACGTTGA 50 

AAGAGATGAA GGTACAGTAT CTCTTACATT AGAAGTGGCA TTACAACTTG 100 

GCGATGATGT CGTACGTACA ATTGCAATGG ATTCTACTGA TGGTGTTAAA 150 

CGTGGTACAG AAGTTCGAGA TAGCGGAGAT AGCATCAGTG TTCCAGTTGG 2 00 

TGATGCTACG TTAGGACGTG TGTTTAATGT TCTTGGTGAT ACAATTGACT 250 

TAGACGAGAA GCTTGATACT TCTGTCAAAC GTGATCCAAT TCATAGAGAA 300 

GCACCTGCAT TCGATCAATT ATCAACAAAA GTTGAAATCT TAGAAACAGG 350 

TATTAAAGTA ATTGATTTAC TTGCACCATA TATTAAAGGT GGTAAAATCG 4 00 

GTTTATTCGG TGGCGCTGGT GTAGGTAAAA CAGTATTAAT TCAAGAATTA 450 

ATTAATAATA TAGCTCAAGA ACATGGTGGT ATTTCAGTAT TTGCCGGCGT 500 

AGGTGAACGT ACGCGTGAAG GTAATGACTT ATACTACGAA ATGAGTGATA 550 

GTGGTGTTAT TAAGAAAACA GCTATGGTCT TCGGACAAAT GAATGAGCCA 600 

CCTGGTGCGC GTATGCGTGT TGCTTTATCA GGCTTAACAA TGGCTGAACA 650 

CTTCCGTGAT GTACAAGGAC AAGATGTTTT ACTATTTATT GATAACATAT 700 

TCAGATTTAC GCAAGCTGGT TCAGAAGTAT CAGCACTATT AGGTCGTATG 750 

CCATCAGCCG TTGGTTATCA ACCTACCCTT GCTACTGAAA TGGGTCAATT 800 

ACAAGAACGT ATTACATCAA CAACTAAAGG ATCTGTAACG TC 842 



2) INFORMATION FOR SEQ ID NO: 376 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 842 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus simulans 

(B) STRAIN: ATCC 27848 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376 

TGATGAACTG CCTAAGATTA ATAACGCATT AGTGCTAGAT GTACCTAAGA 50 

AAGATGGCAC GACTGAATCT CTTACATTAG AAGTAGCACT TGAATTAGGC 100 

GACGACGTAG TTAGAACTAT CGCCATGGAC TCTACAGACG GAATTAAACG 150 

TGGTGACGAC GTTAAAGACA CTGGTCGTCC AATCAGTGTA CCTGTCGGTG 200 

AAGATACGTT AGGAAGAGTA TTTAACGTTT TAGGTGATCC AATCGATAAT 250 

GATGGACCGA TTTCTGAATC AGTTCCACGT GAACCAATTC ATAGACAACC 300 

ACCTAAATTT GATGAATTAT CAACAAAAGT TGAACTACTT GAAACTGGTA 350 

TCAAAGTAGT AGACTTATTA GCACCATATA TCAAAGGTGG TAAAGTTGGT 40 0 

TTATTCGGTG GTGCCGGAGT AGGTAAAACT GTATTAATCC AAGAATTAAT 450 

TAATAACATC GCTCAAGAAC ACGGCGGTAT TTCAGTATTC GCAGGTGTTG 500 

GTGAACGTAC ACGTGAAGGT AACGACTTGT ACTTCGAAAT GAGCGACAGT 550 

GGTGTTATCA AGAAAACAGC GATGGTATTC GGACAAATGA ACGAACCACC 600 

TGGTGCACGT ATGCGTGTAG CTTTATCAGG TTTAACAATG GCTGAATACT 650 

TCCGTGATGT TAAAGGACAA GACGTTCTTT TATTCATCGA TAACATTTTC 70 0 

CGCTTCACAC AAGCAGGTTC TGAGGTATCA GCATTGCTTG GCCGTATGCC 750 

ATCAGCCGTT GGTTACCAAC CAACATTGGC AACAGAAATG GGTCAATTAC 800 

AAGAACGTAT CACTTCTACA ATGAAAGGTT CTATCACATC TA 842 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: ATCC 27836 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377 

CATAACGAAG TCCCTGATAT TAATAATGCC CTTATTATTG AAGTTCCAAA 50 

AGAAGATGGA ACGTTAAACT TAACATTAGA AGTTGCACTA CAATTAGGTG 100 

ATGATGTTGT ACGTACAATT GCAATGGATT CAAC TGATGG TGTTCAAAGA 150 

GGCATGGATG TTAAAGACAC AGGTAAAGAT ATTAGTGTAC CTGTAGGCGA 200 

TGAAACGCTT GGAAGAGTGT TTAATGTACT AGGTGAAACA ATTGACTTGG 250 

AAGAGAAAAT TGATGATTCC GTACGTCGTG ATCCAATCCA TAGACAATCA 300 

CCAGGTTTCG ATGAATTATC TACTGAAGTA GAAATCTTAG AAACAGGTAT 350 

TAAAGTAGTA GACTTATTAG CACCTTACAT TAAAGGTGGT AAAGTTGGAC 400 

TATTCGGTGG TGCCGGAGTA GGTAAAACCG TTTTAATCCA AGAATTAATT 450 

AACAATATTG CACAAGAACA TGGTGGTATT TCAGTATTCG CGGGTGTAGG 500 

TGAACGTACT CGTGAAGGTA ATGATTTATA CTATGAAATG AGTGATAGTG 550 

GTGTAATTAA GAAAACAGCG ATGGTATTTG GACAAATGAA TGAACCACCT 600 

GGCGCACGTA TGCGTGTAGC TTTATCTGGT TTAACTATGG CTGAATACTT 650 

CCGTGATGAA CAAGGACAAG ACGTACTTTT ATTCATCGAT AATATTTTCA 700 

GATTTACACA AGCTGGTTCT GAAGTTTCTG CATTACTTGG TCGTATGCCT 750 

TCAGCCGTTG GTTACCAACC AACATTAGCA ACTGAAATGG GTCAATTACA 800 

AGAACGAATT ACATCTACAA ATAAAGGTTC TGTAACATCT A 841 



2) INFORMATION FOR SEQ ID NO: 378 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus acidominimus 

(B) STRAIN: ATCC 51726 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378 



TTTAACACGA ATGAACCGCT TCCTGAGATA AATAATG CAC 
CAAAGACAGT GAGAAAAAAC ATAAAATCGT TCTTGAAGTA 
TTGGTGAAGG CCTCGTTCGT ACCATTGCTA TGGAATCAAC 
ACACGTGGTC TAGAAGTTCT TGATACAGGC CGTGCAATCA 
TGGTAAAGAA ACGCTTGGAC GTGTCTTCAA CGTTCTTGGT 
ATC TTGAAGA ACCATTTGGA GAAGATGCAG AACGTCACCC 
AGTGCCCCAA CTTTTGATGA ATTATCAACG TCAACAGAAA 
AGGGATTAAA GTTATCGACC TACTTGCCCC TTACTTAAAA 



TTGTTGTTTA 
GCTCTTGAAC 
TGATGGTTTG 
GTGTACCAGT 
GATGCTATCG 
CATTCATAAG 
TCCTTGAAAC 
GGAGGGAAGG 



50 
100 
150 
200 
250 
300 
350 
400 
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TTGGACTTTT CGGTGGTGCC GGAGTTGGTA 
TTGATTCATA ACATTGCTCA AGAGCATGGT 
AGTTGGTGAA CGTACACGTG AAGGTAATGA 
AATCAGGCGT TATTGAAAAA ACAGCTATGG 
5 CCACCTGGTG CACGTATGCG TGTAGCCCTT 
ATATTTCCGT GATGTTGAAG GACAGGACGT 
TTTTTCGTTT CACACAAGCA GGTTCTGAAG 
ATGCCATCAG CCGTTGGTTA TCAACCAACC 
ATTGCAAGAA CGTATCACGT CAACTAAAAA 



PCT/CA00/01150 

AGACCGTTCT TATCCAAGAG" 45T) 

GGTATTTCAG TATTTACCGG 500 

CCTCTATTGG GAAATGAAAG 550 

TATTTGGTCA GATGAATGAG 600 

ACTGGTTTGA CAATCGCTGA 650 

GCTTCTCTTT ATTGATAACA 700 

TTTCAGCTCT TCTTGGACGT 750 

TTGGCAACTG AAATGGGTCA 8 00 

AGGTTCTGTT ACATCA 846 



2) INFORMATION FOR SEQ ID NO: 379 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 
25 (B) STRAIN: ATCC 12403 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379 

TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 50 

3 0 AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTA CTCTTGAACT 100 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGTTT GGAAGTATTA GATACTGGTC GCGCAATTAG TGTGCCGGTT 200 

GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 250 

CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 300 

3 5 AAGCACCATC GTTTGATGAA TTATCAACAT CAT C AG AAAT CTTAGAAACA 350 

GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 400 

TGGACTTTTC GGTGGTGCAG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 4 50 

TAATCCACAA CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTACTGGT 500 

GTAGGAGAAC GTACTCGTGA AGGGAATGAC CTTTATTGGG AAATGAAAGA 550 

4 0 ATCTGGCGTT ATTGAAAAAA CGGCTATGGT CTTTGGTCAA ATGAATGAAC 600 

CACCAGGAGC ACGTATGCGT GTGGCACTTA CTGGTCTTAC AATAGCTGAG 650 

TACTTCCGTG ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 700 

CTTCCGTTTC ACACAAGCTG GGTCAGAAGT GTCAGCGCTT TTAGGTCGTA 750 

TGCCTTCAGC CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 800 

45 TTACAAGAGC GTATCACTTC AACTAAAAAA GGTTCTGTTA CCTCAA 846 



2) INFORMATION FOR SEQ ID NO: 380 

50 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Streptococcus agalactiae 
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(B) 



STRAIN: ATCC 13813 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380 



TTGCAAGTGG 
AAAAATGGCG 
TGGTGACGGC 
CACGTGGTTT 
GGTAAGGATA 
TCTTGAAGAG 
AAGCACCATC 
GGTATTAAAG 
TGGACTTTTC 
TAATCCACAA 
GTAGGAGAAC 
ATCTGGCGTT 
CACCAGGAGC 
TACTTCCGTG 
CTTCCGTTTC 
TGCCTTCAGC 
TTACAAGAGC 



CGACAAACTT 
ATAAGTCACA 
CTCGTTCGTA 
GGAAGTATTA 
CTTTGGGTCG 
CCTTTTGCAG 
GTTTGATGAA 
TTATTGACTT 
GGTGGTGCGG 
CATCGCCCAA 
GTACTCGTGA 
ATTGAAAAAA 
ACGTATGCGT 
ATGTAGAAGG 
ACACAAGCTG 
CGTTGGTTAT 
GTATCACTTC 



CCTGAGATTA 
AAAAGTAGTA 
CAATCGCTAT 
GATACTGGTC 
TGTCTTCAAC 
AAGATGCAGA 
TTATCAACAT 
ATTAGCACCT 
GTGTTGGTAA 
GAACATGGTG 
AGGGAATGAC 
CGGCTATGGT 
GTGGCACTTA 
ACAAGATGTG 
GGTCAGAAGT 
CAACCAACAC 
AACTAAAAAA 



ATAATGCATT 
CTTGAAGTTG 
GGAATCAACT 
GCGCAATTAG 
GTTCTTGGAG 
ACGTCAACCA 
CATCAGAAAT 
TACTTAAAAG 
AACCGTTCTT 
GTATTTCAGT 
CTTTATTGGG 
CTTTGGTCAA 
CTGGTCTTAC 
CTTCTCTTCA 
GTCAGCGCTT 
TTGCTACAGA 
GGTTCTGTTA 



GATTGTCTAT 
CTCTTGAACT 
GATGGGCTTA 
TGTGCCGGTT 
ATGCTATTGA 
ATC CAT AAAA 
CTTAGAAACT 
GTGGTAAAGT 
ATTCAAGAAT 
ATTTACTGGT 
AAATGAAAGA 
ATGAATGAAC 
AATAGCTGAG 
TTGATAATAT 
TTAGGTCGTA 
AATGGGACAA 
CCTCAA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
846 



2) INFORMATION FOR SEQ ID NO: 381 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 12973 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381 

TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 50 

AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTG CTCTTGAACT 100 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGTTT GGAAGTATTA GATACTGGTC GTGCAATTAG TGTGCCGGTT 200 

GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 250 

CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 3 00 

AAGCACCATC ATTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACA 350 

GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 4 00 

TGGACTTTTC GGTGGTGCGG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 450 

TAATCCACAA CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTACTGGT 500 

GTAGGAGAAC GTACTCGTGA AGGGAATGAC CTTTATTGGG AAATGAAAGA 550 

ATCTGGCGTT ATTGAAAAAA CGGCTATGGT CTTTGGTCAA ATGAATGAAC 600 

CACCAGGAGC ACGTATGCGT GTGGCACTTA CTGGTCTTAC AATAGCTGAG 650 

TACTTCCGTG ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 700 

CTTCCGTTTC ACACAAGCTG GGTCAGAAGT GTCAGCGCTT TTAGGTCGTA 750 

TGCCTTCAGC CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 800 

TTACAAGAGC GTATCACTTC AACTAAAAAA GGTTCTGTTA CCTCA 845 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 27591 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382 

TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 50 

AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTG CTCTTGAACT 100 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGTTT GGAAGTATTA GATACTGGTC GCGCAATTAG TGTGCCGGTT 200 

GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 2 50 

CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 3 00 

AAGCACCATC GTTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACT 350 

GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 400 

TGGACTTTTC GGTGGTGCAG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 450 

TAATCCACAA CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTACTGGT 500 

GTAGGAGAAC GTACTCGTGA AGGGAATGAC CTTTATTGGG AAATGAAAGA 550 

ATCTGGCGTT ATTGAAAAAA CGGCTATGGT CTTTGGTCAA ATGAATGAAC 600 

CACCAGGAGC ACGTATGCGT GTGGCACTTA CTGGTCTTAC AATAGCTGAG 650 

TACTTCCGTG ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 700 

CTTCCGTTTC ACACAAGCTG GGTCAGAAGT GTCAGCGCTT TTAGGTCGTA 750 

TGCCTTCAGC CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 800 

TTACAAGAGC GTATCACTTC AACTAAAAAA GGTTCTGTTA CATCA 845 



2) INFORMATION FOR SEQ ID NO: 383 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: CDC ssl073 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 83 

TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 50 

AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTG CTCTTGAACT 100 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGTTT GGAAGTATTA GATACTGGTC GCGCAATTAG TGTGCCGGTT 200 

GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 250 

CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 300 

AAGCACCATC GTTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACT 350 

GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 400 
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TGGACTTTTC GGTGGTGCAG GTGTTGGTAA 
TAATCCACAA CATCGCCCAA GAACATGGTG 
GTAGGAGAAC GTACTCGTGA AGGGAATGAC 
ATCTGGCGTT ATTGAAAAAA CGGCTATGGT 
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CTTCCGTTTC ACACAAGCTG GGTCAGAAGT 
TGCCTTCAGC CGTTGGTTAT CAACCAACAC 
TTACAAGAGC GTATCACTTC AACTAAAAAA 



PCT/CA00/01150 

AACCGTTCTT ATTCAAGAA^ 450 

GTATTTCAGT ATTTACTGGT 500 

CTTTATTGGG AAATGAAAGA 550 

CTTTGGTCAA ATGAATGAAC 600 

CTGGTCTTAC AATAGCTGAG 650 

CTTCTCTTCA TTGATAATAT 700 

GTCAGCGCTT TTAGGTCGTA 750 

TTGCTACAGA AATGGGACAA 800 

GGTTCTGTTA CATC A 845 



2) INFORMATION FOR SEQ ID NO: 384 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 

( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus dysgalactiae 
25 (B) STRAIN: ATCC 43078 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384 

TTGCTAGTGG GGACAAACTT CCAGAGATTA ATAATGCATT GATTGTTTAT 50 

30 AAAGATAGTG ATAAAAAGCA AAAAATCGTC CTTGAAGTTG CTCTGGAACT 100 

TGGTGACGGT ATGGTGCGAA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGGTT AGAAGTTCTT GACACTGGTC GTGCGATTAG TGTACCAGTA 2 00 

GGTAAAGAAA CTTTGGGACG CGTCTTTAAT GTACTTGGAG AAACCATTGA 250 

CTTGGAAGAA CCATTTGCAG AAGACGTTGA CCGTCAGCCA ATCCATAAAA 3 00 

35 AAGCACCATC GTTTGATGAA TTATCAACAT CATCAGAAAT TCTTGAAACT 350 

GGTATCAAGG TAATTGACCT TCTTGCCCCT TACCTTAAAG GTGGTAAAGT 400 

TGGACTTTTC GGGGGTGCCG GAGTTGGTAA GACTGTCCTT ATCCAAGAAT 4 50 

TAATCCACAA TATCGCCCAA GAACACGGAG GTATTTCAGT ATTTACCGGT 500 

GTTGGTGAGC GAACACGTGA AGGAAATGAC CTTTACTGGG AAATGAAAGA 550 

40 ATCAGGCGTT ATTGAGAAAA CTGCCATGGT TTTTGGTCAG ATGAATGAGC 600 

CGCCTGGGGC ACGTATGCGT GTAGCCCTTA CTGGTTTAAC CATTGCTGAG 650 

TATTTCCGTG ATGTAGAAGG CCAAGATGTT TTGCTCTTTA TTGATAATAT 700 

CTTCCGTTTC ACT CAGGCAG GTTCAGAAGT ATCAGCCCTC TTAGGCCGTA 750 

TGCCTTCTGC TGTTGGTTAC CAACCGACCC TTGCTACTGA AATGGGACAA 800 

45 TTGCAAGAAC GTATTACGTC AACTCAAAAA GGATCTGTTA CTTCT 845 



2) INFORMATION FOR SEQ ID NO: 385 



50 



( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Streptococcus equi subsp. egui 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385 

5 TTGCGAGTGG GGACAAACTA CCAGAGATTA ATAATGCGTT GATAGTTTAT 50 

AAAGATGGCG ATAAAAAGCA AAAAATCGTT CTCGAGGTTG CCCTAGAGCT 100 

TGGAGACGGT ATGGTACGTA CAATTGCTAT GGAATCAACC GATGGGCTTA 150 

CACGTGGATT AGAGGTTCTT GATACTGGTC GTGCCATTAG TGTACCAGTT 2 00 

GGTAAAGAGA CTCTAGGTCG TGTTTTCAAC GTTCTTGGTG AAACCATCGA 250 

10 CCTAGAAGCA CCATTTGCAG ATGATGTTAA TCGTGAACCG ATCCATAAAA 300 

AAGCACCAGC CTTTGATGAA TTGTCAACAT CATCAGAAAT TCTTGAAACA 350 

GGTATCAAGG TTATTGACCT GCTTGCCCCT TACTTAAAGG GTGGTAAGGT 4 00 

CGGTCTTTTC GGTGGTGCCG GAGTTGGTAA AACCGTTCTT ATCCAAGAAT 450 

TAATCCACAA TATCGCTCAA GAGCATGGTG GGATCTCGGT ATTTACCGGT 500 

15 GTTGGTGAGC GTACGCGTGA AGGAAATGAC CTTTACTGGG AAATGAAGGA 550 

ATCAGGCGTT ATTGAAAAAA CAGCCATGGT TTTTGGTCAG ATGAATGAAC 600 

CACCAGGAGC CCGTATGCGT GTTGCCTTGA CCGGCTTGAC AATTGCTGAA 650 

TATTTCCGCG ATGTTGAAGG CCAAGACGTC CTGCTCTTCA TTGACAATAT 700 

TTTCCGCTTT ACTCAAGCAG GCTCAGAGGT ATCAGCCCTT CTAGGTCGTA 750 

20 TGCCTTCAGC CGTTGGTTAC CAGCCAACAC TTGCCACTGA AATGGGACAA 800 

TTGCAAGAGC GTATCACCTC AACGAAAAAA GGCTCTGTTA CCTCTA 84 6 



25 2 ) INFORMATION FOR SEQ ID NO: 386 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus anginosus 

(B) STRAIN: ATCC 27335 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386 



TTGCAGCTGG TGATAAATTA CCTGAGATAA ATAATGCACT TGTAGTCTAT 50 

AAAAATGACG AAAATAAATC AAAAATCGTC CTTGAAGTAG CTCTTGAGCT 100 

TGGTGATGGA GTGGTTCGAA CTATTGCCAT GGAATCCACT GATGGGTTGA 150 

CTCGTGGCAT GGAAGTTCTA GATACTGGTC GACCAATTTC TGTTCCAGTT 200 

45 GGGAAAGAAA CACTTGGTCG CGTCTTTAAC GTTTTAGGCG ATACCATTGA 250 

TTTGGATACT CCATTCGGCG AAGATGCAGA ACGTCAACCA ATCCATAAAA 3 00 

AAGCTCCAAC TTTTGATGAG TTATCTACTT CATCAGAAAT CTTAGAAACA 3 50 

GGAATAAAGG TTATTGACCT TTTAGCCCCC TACCTCAAAG GTGGGAAAGT 400 

CGGCCTCTTC GGTGGTGCTG GCGTTGGGAA AACTGTCTTG ATTCAAGAGT 450 

50 TGATTCATAA TATCGCCCAA GAACACGGCG GGATTTCAGT CTTTACTGGT 500 

GTTGGGGAAC GAACTCGTGA AGGGAATGAC CTGTACTGGG AAATGAAAGA 550 

ATCTGGTGTT ATCGAAAAGA CGGCTATGGT CTTTGGGCAA ATGAATGAAC 600 

CGCCTGGAGC ACGTATGCGT GTAGCTTTGA CTGGGTTAAC GATTGCAGAG 650 

TATTTCCGTG ATGTGGAAGG TCAAGATGTT CTTTTGTTTA TTGATAATAT 700 

55 TTTCCGTTTC ACTCAAGCTG GTTCTGAAGT GTCAGCCCTT CTTGGTCGTA 750 

TGCCATCAGC TGTTGGTTAC CAACCAACCT TGGCTACTGA AATGGGGCAA 800 

TTACAAGAAC GTATTACATC AACGAAAAAA GGTTCTGTTA CCTCAA 84 6 



60 
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10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus salivarius 

(B) STRAIN: ATCC 7073 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387 

GCAG'CTGGTG ATAAACTTCC TGAGATTAAC AATGCATTGG TCGTTTATAC 50 

TGATGAACAA AAGTCTAAAC GTATCGTGCT CGAAGTAGCT CTTGAACTTG 100 

GAGAAGGTGT GGTTCGTACC ATTGCCATGG AATCTACTGA TGGATTGACT 150 

2 0 CGTGGACTAG AAGTTCTGGA CACTGGTCGT CCAATCAGCG TTCCTGTTGG 200 

TAAAGATACC CTTGGACGTG TCTTTAACGT TCTTGGTGAT ACCATTGACT 250 

TGGAAGCACC TTTTGCAGAC GATGCAGAGC GTGAACCAAT TCACAAAAAA 300 

GCACCAACTT TCGATGAATT GTCAACATCT ACTGAAATCC TTGAAACAGG 3 50 

GATTAAAGTT ATCGACTTGC TAGCCCCTTA CCTTAAGGGT GGTAAAGTCG 400 

25 GACTCTTCGG TGGTGCCGGT GTTGGTAAAA CCGTTCTTAT TCAAGAGTTG 450 

ATTCACAACA TTGCCCAAGA GCACGGTGGT ATTTCCGTGT TTACAGGTGT 500 

TGGTGAACGT ACACGTGAAG GTAATGACCT TTACTGGGAA ATGAAAGAAT 550 

CTGGCGTTAT CGAGAAAACA GCCATGGTCT TCGGTCAAAT GAACGAACCA 600 

CCTGGAGCAC GTATGCGTGT TGCCCTTACT GGTTTGACAA TTGCGGAATA 650 

3 0 CTTCCGTGAT GTCGAGGGTC AAGACGTTCT TCTCTTCATC GATAACATCT 700 

TCCGTTTCAC TCAAGCAGGT TCTGAGGTTT CTGCCCTTCT TGGTCGTATG 750 
CCATCAGCCG TTGGTTACCA ACCTACACTT GCTACTGAAA TGGGTCAATT 800 
GCAAGAACGT ATCACATCAA CTAAAAAAGG TTCTGTTACA TCT 843 



35 



50 



2) INFORMATION FOR SEQ ID NO: 388 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus suis 

(B) STRAIN: ATCC 43765 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 88 



TTGCAGCAGA AGATAAACTT CCTGAGATTA ACAACGCACT CGTTGTATAT 50 

AAAAATGATG ATTCCAAACA AAAAGTCGTG CTTGAAGTGG CTTTGGAACT 100 

55 TGGTGATGGC GTTGTACGGA CCATTGCCAT GGAATCAACG GATGGATTGA 150 

CACGTGGGAT GGAAGTTCTC GATACAGGTC GTCCCATCTC TGTTCCAGTC 2 00 

GGTAAAGAAA CGCTGGGTCG TGTCTTCAAT GTGTTGGGAG ATACCATTGA 250 

CCTTGAAGAG TCTTTTCCGG CAGATTTTGA ACGTGAGCCT ATCCATAAGA 3 00 

AAGCGCCGGC TTTTGACGAA TTATCTACTT CAAGCGAAAT TTTGGAAACA 3 50 

60 GGGATTAAGG TTATCGACCT CCTAGCACCT TATCTAAAAG GTGGTAAGGT 400 
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TGGTCTCTTC GGTGGTGCTG 
TGATTCACAA TATTGCCCAA 
GTTGGCGAGC GTACCCGTGA 
ATCAGGTGTT ATTGAAAAAA 
CACCAGGAGC CCGTATGCGT 
TACTTCCGTG ATGTGGAAGG 
CTTCCGTTTC ACACAGGCTG 
TGCCATCAGC CGTTGGTTAT 
TTGCAGGAGC GTATTACCTC 



GTGTTGGTAA 
GAACACGGTG 
AGGGAACGAT 
CGGCCATGGT 
GTTGCTCTTA 
GCAGGATGTT 
GTTCAGAAGT 
CAGCCAACAC 
AACCAAGAAG 



AACCGTTCTT 
GTATCTCTGT 
CTTTACTGGG 
ATTTGGTCAG 
CTGGTTTGAC 
CTTCTGTTCA 
GTCTGCCCTC 
TTGCGACGGA 
GGTTCTGTTA 



ATCCAAGAAT 
ATTTACCGGA 
AAATGAAAGA 
ATGAATGAGC 
TATTGCGGAA 
TCGATAATAT 
TTGGGTCGTA 
GATGGGACAA 
C 



450 
500 
550 
600 
650 
700 
750 
800 
841 
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15 



20 



25 



30 



35 



40 



45 



50 
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2) INFORMATION FOR SEQ ID NO: 3 89 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus uJberis 

(B) STRAIN: ATCC 19436 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389 

GCAAACGGTG AAAAATTACC AGAGATTAAT AATGCATTGA TAGTTTATAA 50 

AGGTAGCGAT AAAAAACAAA AGATTGTTCT TGAAGTTGCT TTGGAACTTG 100 

GGGACGGAAT GGTTCGTACA ATCGCTATGG AATCAACTGA TGGGCTTACA 150 

CGTGGATTAG AAGTTTTAGA TACTGGCCGT GCCATTAGTG TACCAGTCGG 200 

AAAAGAAACT TTGGGTCGTG TTTTCAATGT GCTTGGTGAA ACCATTGATT 250 

TGGATGAACC ATTTGCCGCT GATGCTGCAA GAGAACCCAT CCATAAAAAA 300 

GCCCCAGCAT TTGATGAACT ATCAACGTCT TCAGAAATTC TTGAAACCGG 350 

AATAAAAGTT ATTGACTTAT TAGCCCCTTA TCTCAAAGGT GGTAAAGTTG 400 

GTTTATTTGG TGGTGCCGGA GTAGGTAAAA CGGTTTTAAT TCAAGAATTA 450 

ATTCATAATA TTGCACAAGA ACATGGTGGT ATTTCAGTAT TTACCGGTGT 500 

TGGTGAAAGA ACTCGTGAAG GTAATGACCT TTATTGGGAA ATGAAAGAAT 550 

CTGGCGTTAT TGAAAAAACA GCCATGGTAT TTGGACAAAT GAACGAACCA 600 

CCAGGAGCAC GTATGCGCGT TGCTTTAACA GGTTTAACCA TTGCTGAATA 650 

TTTCCGGGAT GTTGAAGGTC AAGATGTTTT GCTCTTTATT GACAACATTT 700 

TCCGTTTCAC GCAAGCTGGT TCAGAAGTTT CAGCCCTATT GGGTCGTATG 750 

CCTTCAGCGG TAGGATACCA ACCAACACTT GCTACCGAAA TGGGACAATT 800 

GCAAGAAAGA ATTACCTCAA CTAACAAGGG ATCTGTTACT TCTA 844 



2) INFORMATION FOR SEQ ID NO: 390 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 896 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tatumella ptyseos 
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(B) STRAIN: ATCC 33301 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390 

5 TTCCCTCAGG ACGCTGTACC ACAGGTGTAC AACGCTCTTG AGGTTGAAAA 50 

TGGTGATACC CGTCTGGTGC TGGAAGTTCA GCAGCAGCTG GGCGGTGGTG 100 

TCGTTCGTAC GATTGCAATG GGAACCTCTG ACGGCCTGAA ACGTGGCCTT 150 

AAGGTGACCG ATCTGCAAAA ACCGATTCAG GTACCGGTCG GTAAAGCGAC 200 

GCTGGGCCGT ATCATGAACG TACTGGGTCA GCCAATCGAT ATGAAAGGCG 250 

10 ACCTGAAGAA CGAAGATGGT AGCAATGTTG AGGTGAACTC TATTCACCGT 3 00 

GCAGCGCCAA GCTACGAAGA ACTGGCTAAC TCTACTGAGC TGCTGGAAAC 350 

GGGTATCAAG GTTATCGACC TGATCTGTCC GTTTGCAAAA GGCGGTAAAG 400 

TGGGTCTGTT CGGTGGTGCG GGTGTAGGTA AGACCGTCAA CATGATGGAA 450 

CTGATCCGTA ACATCGCTAT CGAGCACTCT GGTTACTCTG TATTTGCAGG 500 

15 GGTGGGTGAG CGTACCCGTG AAGGTAACGA CTTCTACCAC GAAATGACCG 550 

AGTCTAACGT TCTGGATAAA GTTGCTCTGG TTTATGGCCA GATGAACGAG 600 

CCACCAGGAA ACCGTCTGCG CGTTGCGCTG ACCGGTCTGA CTATGGCTGA 650 

AAAATTCCGT GACGAAGGCC GTGACGTACT GCTGTTCGTT GATAACATCT 700 

ATCGTTATAC CCTGGCCGGT ACTGAAGTTT CAGCACTGCT GGGTCGTATG 750 

20 CCTTCTGCGG TAGGTTATCA GCCAACACTG GCCGAAGAAA TGGGTGTTCT 800 

TCAGGAACGT ATCACGTCAA CCAAAACCGG TTCAATCACT TCCGTA 896 



25 2) INFORMATION FOR SEQ ID NO: 391 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trabulsiella guamensis 

(B) STRAIN: ATCC 49490 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391 



40 



TTCCCTCAGG ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTTATGAA 50 

TGGTAGTGAG CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGTGGTGGTA 100 

TCGTACGTAC TATCGCCATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG 150 

GATGTAAAAG ATCTCGAGCA TCCGATCGAA GTCCCGGTAG GTAAAGCAAC 200 

45 GCTGGGTCGT ATCATGAACG TGCTGGGTCA GCCGATCGAT ATGAAAGGCG 250 

ACATCGGCGA AGAAGAGCGT TGGGCTATCC ACCGCGCAGC ACCGTCCTAC 3 00 

GAAGAGCTGT CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTGAT TCGTAACATC 450 

50 GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTGG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTCTGG 550 

ACAAAGTATC CCTGGTGTAT GGACAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CACTGACCGG TCTGACCATG GCTGAGAAGT TCCGTGACGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTCGATAA CATCTACCGT TACACCCTGG 700 

55 CGGGTACTGA AGTATCTGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT 750 

TACCAGCCGA CCCTGGCGGA AGAGATGGGT GTTCTTCAGG AACGTATCAC 800 

CTCAACCAAA ACCGGTTCTA TCACCTCCG 82 9 



60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia bercovieri 

(B) STRAIN: ATCC 43970 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392 

CGAATTCCCC CAAGACGCTG TACCAAAAGT GTACAACGCC CTTGAGGTTG 50 

AAGGCACAGC TCAGAAGCTG GTGCTGGAAG TTCAGCAACA GCTGGGCGGT 100 

GGTGTTGTTC GTTGTATCGC AATGGGCTCT TCCGATGGTC TGAGCCGCGG 150 

GTTGAAAGTC ATCAACCTGG AACACCCAAT TGAAGTGCCG GTGGGTAAAT 2 00 

CAACTCTGGG CCGTATCATG AACGTATTGG GTGACCCAAT CGACATGAAA 2 50 

GGTCCTATCG GTGAAGAAGA GCGTTGGGCA ATCCACCGCG AAGCGCCTTC 3 00 

TTACGAAGAG CTTGCCAGCT CGCAAGATCT GTTAGAAACC GGTATCAAGG 3 50 

TAATGGATCT GATTTGTCCG TTCGCTAAGG GCGGTAAAGT CGGTCTGTTC 4 00 

GGTGGTGCGG GTGTGGGTAA AACAGTCAAC ATGATGGAGC TGATTCGTAA 450 

TATTGCGATT GAGCACTCAG GTTATTCTGT ATTTGCCGGT GTGGGTGAGC 500 

GTACTCGTGA GGGTAACGAC TTCTACCACG AGATGACTGA CTCCAACGTT 550 

CTGGACAAAG TATCCTTGGT TTATGGCCAG ATGAATGAGC CACCAGGTAA 600 

CCGTCTGCGC GTTGCACTGA CCGGCTTGAC CATGGCGGAG AAATTCCGTG 650 

ATGAAGGTCG TGATGTACTG TTATTCATCG ATAACATCTA TCGTTATACC 700 

CTGGCCGGTA CAGAGGTATC TGCACTGCTA GGTCGTATGC CATCAGCGGT 750 

AGGCTATCAG CCAACGCTGG CAGAAGAGAT GGGTGTGTTG CAGGAACGTA 800 

TCACTTCCAC CAAGACGGGT TCAATCACCT CCGTA 835 



2) INFORMATION FOR SEQ ID NO: 393 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia enterocolitica 

(B) STRAIN: ATCC 9610 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393 

GCTGTACCAA AAGTGTACAA CGCCCTTGAG GTTGAAGGCG CAGCTGAGAA 50 

GCTGGTGCTG GAAGTTCAGC AACAGCTGGG CGGTGGTGTT GTTCGTTGTA 100 

TCGCAATGGG CTCTTCCGAT GGTCTGAGCC GTGGGTTGAA AGTCATCAAC 150 

CTGGAACACC CAATTGAAGT GCCTGTGGGC AAGTCAACTC TGGGCCGTAT 2 00 

CATGAACGTA TTGGGTGACC CAATCGACAT GAAAGGTCCT ATCGGCGAAG 250 

AAGAGCGTTG GGCAATCCAT CGTGAAGCGC CTTCTTACGA AGATCTTGCC 3 00 

AGCTCGCAAG ACTTGTTAGA AACCGGTATC AAGGTAATGG ACTTGATTTG 350 

TCCGTTCGCT AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

235 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



GTAAAACGGT AAACATGATG 
TCAGGTTATT CCGTATTTGC 
CGACTTCTAC CACGAGATGA 
TGGTTTATGG CCAAATGAAT 
CTGACCGGCT TGACCATGGC 
ATTGCTGTTC ATCGATAACA 
TTTCCGCACT GCTGGGTCGT 
CTGGCAGAAG AGATGGGTGT 
GGGTTCAATC AC 



GAGCTTATTC 
TGGCGTGGGT 
CTGACTCCAA 
GAGCCACCAG 
GGAGAAATTC 
TCTATCGCTA 
ATGCCATCTG 
GTTGCAGGAA 



GTAACATTGC GATTGAGCAC 
GAGCGTACTC GTGAGGGTAA 
CGTTCTGGAC AAAGTATCCT 
GTAACCGTCT GCGCGTTGCA 
CGTGATGAAG GTCGTGACGT 
TACCTTAGCC GGTACGGAAG 
CCGTAGGTTA CCAGCCAACG 
CGTATTACTT CCACCAAGAC 



450 
500 
550 
600 
650 
700 
750 
800 
812 



2) INFORMATION FOR SEQ ID NO: 3 94 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 802 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia frederiksenii 

(B) STRAIN: ATCC 33641 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 394 

AAAGTGTACA ACGCCCTTGA GGTTGAAGGT ACTGCTGAGA AGTTAGTACT 50 

GGAAGTTCAG CAACAGCTGG GCGGTGGTGT TGCTCGTTGT ATCGCCATGG 100 

GCTCTTCCGA TGGTTTGAGC CGCGGGTTGA AAGTTGTCAA CCTGGAACAC 150 

CCAATTGAAG TACCGGTTGG TAAATCAACT CTGGGCCGTA TCATGAACGT 200 

ATTGGGTGAC CCAATCGACA TGAAAGGTCC TATCGGTGAA GAAGAGCGTT 250 

GGGCAATCCA CCGCGAAGCG CCTTCTTACG AAGAGCTTGC CAGCTCGCAA 300 

GATCTGTTAG AAACCGGTAT CAAGGTAATG GATCTGATTT GCCCGTTCGC 350 

TAAAGGCGGT AAAGTCGGTC TGTTCGGTGG TGCGGGTGTA GGTAAAACGG 400 

TAAACATGAT GGAGCTGATC CGTAATATCG CGATCGAGCA CTCAGGTTAT 450 

TCCGTATTTG CGGGTGTTGG TGAACGTACC CGTGAGGGTA ACGACTTCTA 500 

CCACGAGATG ACTGACTCCA ACGTTCTGGA CAAAGTATCC TTGGTTTATG 550 

GCCAGATGAA TGAGCCACCA GGTAACCGTC TTCGCGTTGC ACTGACCGGT 600 

CTGACCATGG CGGAGAAATT CCGTGATGAA GGTCGTGACG TATTGCTGTT 650 

CATCGATAAC ATCTATCGTT ATACCTTGGC CGGTACGGAA GTATCCGCAC 700 

TGCTGGGTCG TATGCCATCT GCGGTAGGCT ATCAGCCAAC GCTGGCAGAA 750 

GAGATGGGTG TGTTGCAGGA ACGTATTACT TCCACCAAGA CGGGTTCAAT 800 

CA 802 



2) INFORMATION FOR SEQ ID NO: 395 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia intermedia 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 395 



GCTGTACCAA 
GCTGGTGCTG 
TCGCAATGGG 
CTGGAACACC 
CATGAACGTA 
AAGAGCGTTG 
AGCTCACAAG 
CCCGTTCGCT 
GTAAAACAGT 
TCAGGTTATT 
CGACTTCTAC 
TGGTGTATGG 
CTGACCGGCC 
ACTGTTGTTC 
TATCCGCACT 
CTGGCAGAAG 
GGGTTC 



GAGTGTACAA 
GAAGTTCAGC 
CTCTTCCGAT 
CAATTGAAGT 
TTGGGTGACC 
GGCAATCCAC 
ATTTGTTAGA 
AAGGGCGGTA 
AAACATGATG 
CTGTATTTGC 
CACGAGATGA 
CCAGATGAAT 
TGACCATGGC 
ATCGATAACA 
GCTGGGTCGT 
AGATGGGTGT 



CGCCCTTGAG 
AACAGCTAGG 
GGTCTGAGCC 
GCCGGTTGGT 
CAATCGACAT 
CGCGAAGCGC 
AACCGGTATC 
AAGTGGGTCT 
GAGCTTATTC 
TGGTGTGGGT 
CTGACTCCAA 
GAGCCACCAG 
GGAGAAATTC 
TCTATCGCTA 
ATGCCATCAG 
GTTGCAGGAA 



GTTGAAGGCA CTGCTGAGAA 
CGGTGGTGTT GTTCGTTGTA 
GCGGGTTGAA AGTCATCAAC 
AAATCAACTC TGGGCCGTAT 
GAAAGGTCCT ATCGGTGAAG 
CTTCTTACGA AGAGCTTGCC 
AAAGTAATGG ACTTGATTTG 
GTTCGGTGGT GCGGGTGTAG 
GTAACATCGC GATTGAGCAC 
GAGCGTACTC GTGAGGGTAA 
CGTTCTGGAC AAAGTATCCT 
GTAACCGTCT GCGCGTTGCA 
CGTGATGAAG GTCGTGACGT 
TACCTTGGCC GGTACGGAAG 
CGGTAGGCTA CCAGCCAACG 
CGTATTACGT CCACCAAGAC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
806 



2) INFORMATION FOR SEQ ID NO: 3 96 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia pseudotuberculosis 

(B) STRAIN: ATCC 29833 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396 

GCTGTACCAA AAGTGTACAA CGCCCTTGAG GTAGAAGGCA CAACTGAAAA 50 

GTTAGTGCTG GAAGTTCAGC AACAGTTGGG CGGTGGTGTT GTTCGTTGTA 100 

TCGCAATGGG CTCTTCCGAT GGTCTGAGCC GTGGGTTGAA AGTAACCAAC 150 

CTGGAACACC CGATCGAAGT ACCGGTTGGT AAAGCGACCC TTGGCCGTAT 200 

CATGAACGTA TTGGGTGAAC CAATCGACAT GAAAGGTCCT ATCGGTGAAG 2 50 

AAGAGCGTTG GGCAATCCAT CGCGAAGCGC CTTCTTATGA AGAGCTTGCT 3 00 

AGCTCACAAG ATCTGTTAGA AACCGGTATC AAGGTTATGG ACCTGATTTG 350 

TCCGTTTGCT AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACAGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

TCTGGGTATT CTGTATTTGC CGGTGTAGGT GAGCGTACCC GTGAGGGTAA 500 

TGACTTCTAC CATGAAATGA CTGACTCCAA CGTTTTGGAC AAAGTATCCT 550 

TGGTTTACGG CCAGATGAAT GAGCCACCAG GTAACCGTCT ACGCGTTGCA 600 

CTGACCGGCC TGACCATGGC GGAGAAATTC CGTGATGAAG GTCGTGACGT 650 

ACTGCTGTTC ATCGATAATA TCTATCGTTA TACCCTAGCT GGTACGGAAG 700 

TATCCGCATT GCTGGGTCGT ATGCCATCAG CGGTAGGTTA TCAGCCAACA 750 

CTGGCTGAAG AGATGGGTGT GTTGCAGGAA CGTATTACTT CCACTAAGAC 800 

GGGTTC 806 



237 
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WO 01/23604 PCT/CA00/01150 
2) INFORMATION FOR SEQ ID NO: 397 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia rohdei 

(B) STRAIN: ATCC 43380 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 97 

TTCCCCCAAG ACGCTGTACC AAAAGTGTAC AACGCCCTTG AGGTTGAAGG 50 

TGCAGCTGAG AAGCTTGTGC TGGAAGTTCA GCAGCAGCTG GGCGGTGGTG 100 

TTGTTCGTTG TATCGCAATG GGCTCTTCCG ATGGTTTGAG CCGTGGGTTG 150 

20 AAAGTTATCA ACCTGGAACA CCCAATTGAA GTGCCAGTTG GTAAATCAAC 2 00 

TCTGGGCCGT ATCATGAACG TATTGGGTGA . CCCAATCGAC ATGAAAGGCC 250 

CTATCGGTGA AGAAGAGCGT TGGGCAATCC ACCGTGAAGC GCCTTCTTAC 3 00 

GAAGAGCTTG CCAGCTCGCA AGATCTGTTA GAAACCGGTA TCAAGGTAAT 3 50 

GGATCTGATT TGTCCGTTCG CTAAGGGCGG TAAAGTCGGT CTGTTCGGTG 4 00 

25 GTGCGGGTGT TGGTAAAACA GTAAACATGA TGGAGCTTAT TCGTAACATC 450 

GCGATTGAGC ACTCAGGTTA TTCTGTATTT GCCGGGGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAGAT GACTGACTCC AACGTTCTGG 550 

ACAAAGTATC CTTGGTTTAT GGCCAGATGA ATGAGCCACC AGGTAACCGT 600 

CTGCGCGTTG CACTGAC CGG CTTGACCATG GCGGAAAAAT TCCGTGATGA 650 

30 AGGCCGTGAC GTATTGCTGT TCATCGATAA CATTTATCGT TATACCCTAG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCATC TGCGGTAGGC 750 

TATCAGCCAA CACTGGCAGA AGAGATGGGT GTGTTGCAGG AACGTATTAC 800 

TTCCACTAAG ACGGGTTCAA TCACCTCCG 829 



35 



50 



2) INFORMATION FOR SEQ ID NO: 3 98 



(i) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yokenella regensburgei 

(B) STRAIN: ATCC 35313 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398 



ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTACAAAA TGGTAACGAG 50 

AAACTGGTGC TGGAAGCTCA GCAGCAGCTC GGCGGCGGTA TCGTGCGTAC 100 

55 TATCGCCATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG GAAGTTAAAG 150 

ACCTCGAGCA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC CCTGGGCCGT 200 

ATCATGAACG TCCTGGGTCA GCCGATCGAC ATGAAAGGCG ACATCGGTGA 250 

AGAAGAGCGT TGGGCTATCC ACCGCGCAGC ACCTTCCTAT GAAGAGCTGT 3 00 

CCAGCTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTAAT GGATCTGATC 350 

60 TGCCCGTTCG CTAAGGGTGG TAAAGTCGGT CTGTTCGGTG GTGCGGGTGT 400 

238 
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AGGTAAAACT GTAAACATGA TGGAGCTTAT 
ACTCCGGTTA CTCTGTGTTT GCAGGCGTGG 
AACGACTTCT ACCACGAAAT GACCGACTCC 
CCTGGTGTAC GGCCAGATGA ACGAGCCGCC 
5 CGCTGACCGG CCTGACCATG GCTGAGAAAT 
GTTCTGCTGT TCGTCGATAA CATCTACCGT 
AGTATCCGCA CTGCTGGGTC GTATGCCTTC 
CTCTGGCGGA AGAGATGGGT GTTCTTCAGG 
ACCGGTTCTA TCACCTCCG 



PCT/CA00/01150 

CCGTAACATC GCGATTGAGC ' 4^0' 
GTGAACGTAC TCGTGAGGGT 500 
AACGTACTGG ATAAAGTATC 550 
GGGAAACCGT CTGCGCGTTG 600 
TCCGTGACGA AGGCCGTGAC 650 
TATACCCTGG CCGGTACGGA 700 
TGCGGTAGGT TATCAGCCAA 750 
AACGTATCAC CTCTACCAAA 800 

819 



2) INFORMATION FOR SEQ ID NO: 399 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1097 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yarrowia lipolytics 
25 (B) STRAIN: ATCC 38295 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399 

AAGCTTAAGG CTGAGCGAGA GCGAGGTATC ACCATTGATA TCGCTCTCTG 50 

30 GAAGTTCCAG ACCCCTAAGT ACTACGTCAC CGTTATTGAT GCTCCCGGTC 100 

ACCGAGATTT CATCAAGAAC ATGATTACCG GTACTTCCCA GGCTGACTGC 150 

GCCATCCTCA TCATTGCTGG TGGTGTTGGT GAGTTCGAGG CTGGTATCTC 200 

CAAGGATGGT CAGACCCGAG AGCACGCCCT GCTCGCTTTC ACCCTCGGTG 250 

TTAAGCAGCT CATTGTTGCT ATCAACAAGA TGGACTCCGT CAAGTGGTCT 3 00 

35 CAGGATCGAT ACCTCGAGAT TTGCAAGGAG ACTGCCAACT TCGTCAAGAA 350 

GGTCGGTTAC AACCCCAAGG CTGTCCCCTT CGTCCCCATT TCCGGATGGA 400 

ACGGTGACAA CATGATCGAG CCCTCTACCA ACTGTGACTG GTACAAGGGA 450 

TGGACCAAGG AGACCAAGGC CGGCGAGATC AAGGGTAAGA CCCTCCTCGA 500 

GGCCATTGAT GCCATTGAGC CCCCCGTGCG ACCCCACGAC AAGCCCCTCC 550 

40 GACTTCCCCT CCAGGATGTC TACAAGATCG GTGGTATCGG CACAGTGCCC 600 

GTTGGCCGAG TCGAGACCGG TGTTATCAAG GCCGGTATGG KTGTTACCTT 650 

CGCTCCCGCC AACGTGACCA CTGAGGTCAA GTCTGTCGAG ATGCACCACG 700 

AGATCCTCCC CGATGGAGGT TTCCCCGGTG ACAACGTCGG TTTCAACGTC 750 

AAGAACGTTT CCGTCAAGGA TATCCGACGA GGTAACGTTG CTGGTGACTC 800 

4 5 CAAGAACGAC CCCCCCAAGG GCTGCGACTC TTTCAACGCT CAGGTCATTG 850 

TTCTTAACCA CCCCGGTCAG ATCGGTGCTG GTTACGCTCC CGTCCTTGAT 900 

TGCCACACTG CCCACATTGC TTGCAAGTTC GACACCCTGA TCGAGAAGAT 950 

CGACCGACGA ACCGGTAAGA AGATGGAGGA CTCCCCCAAG TTCATCAAGT 1000 

CTGGTGATGC TGCCATTGTC AAGATGGTTC CCTCCAAGCC CATGTGTGTC ~ 1050 

50, GAGGCCTTCA CTGAGTACCC CCCTCTTGGT CGATTCGCCG TCCGAGA 1097 



2 ) INFORMATION FOR SEQ ID NO: 400 

55 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1233 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 

239 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 
5 (A) ORGANISM: Absidia corymbifera 

(B) STRAIN: ATCC 46775 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 00 

10 CAAGCTTAAG GCTGAACGTG AGCGTGGTAT CACCATCGAT ATCGCTCTCT 50 

GGAAGTTCGA GACTCCCAAG TACCACGTTA CCGTCATTGA TGCCCCTGGC 100 

CATCGTGATT TCATCAAGAA CATGATTACT GGTACTTCCC AAGCTGACTG 150 

CGGTATCTTG ATTATTGCTG CTGGTACTGG TGAATTCGAA GCTGGTATCT 200 

CCAAGGATGG TCAAACCCGT GAACACGCTT TGCTTGCTTT CACCCTTGGT 250 

15 GTCCGTCAAT TGATTGTCGC TATCAACAAG ATGGATTCCA CCAAGTACTC 300 

TGAGGCCCGT TACAACGAAA TTGTCAAGGA AGTCTCCACC TTCATCAAGA 350 

AGATTGGTTT CAACCCCAAG TCCGTTCCTT TCGTCCCTAT CTCTGGCTGG 400 

AACGGTGACA ACATGTTGGA GGARTCCACC AACATGCCTT GGTTCAAGGG 450 

ATGGAACAAG GAGACTAAGG CTGGTGCCAA GACYGGCAAG ACCCTTCTTG 500 

20 AAGCCATTGA CAACATTGAT CCCCCTGTTC GTCCTTCCGA CAAGCCCCTT 550 

CGTCTTCCCC TTCAAGATGT CTACAAGATC GGTGGTATTG GTACAGTTCC 600 

TGTCGGTCGT GTTGAGACTG GTGTCATCAA GCCTGGTATG GTTGTCACCT 650 

TCGCTCCCGC TAACGTCACC ACTGAAGTCA AGTCCGTYGA AATGCACCAC 700 

GAGCAACTTG CTGAAGGTGT TCCCGGTGAC AACGTCGGTT TCAACGTCAA 750 

25 GAACGTTTCC GTCAAGGATA TCCGCCGTGG TAACGTYTGC TCTGACTCCA 8 00 

AGAACGACCC CGCCAAGGAA TCCGCTTCCT TCACCGCTCA AGTTATTGTC 850 

TTGAACCACC CTGGTCARAT TGGTGCTGGT TACTCTCCTG TCTTGGATTG 900 

CCACACTGCT CACATTGCAT GCAAGTTCTY TKAGCTTCTT KAGAAGATCG 950 

ATYGTCGTTC CGGTAAGTAA ATANTTTGGT TTRGGATATG GGTATTGGGC 1000 

30 TTAATCTYTG GATTTTGCCT CAATTGCTCC TTCCTTGATC TTTCTCGATT 1050 

ACTTTTTGAT CATTTGCTAA TCCAAACCCT TTCCATTTYA TTGAAAACAG 1100 

GTAAGAAGTT GGAAGACTCC CCCAAGTTCG TCAAGWSYGG TGACTCTGCY 1150 

ATCGTCAAGA TGGTTCCTTC CAAGCCCATG TGCGTTGAAG CCTACACTGA 12 00 

ATATCCTCCT CTTGGTCGTT TCGCTGTCCG TGA 1233 

35 



2) INFORMATION FOR SEQ ID NO: 4 01 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1151 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alternaria alternata 
50 (B) STRAIN: ATCC 62099 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401 

CAAGTTGAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCTCTCT 50 

55 GGAAGTTCGA GACTCCCAAG GTTAGTACCC CTCTGCCTAC TACATCAAGT 100 

TCTTTACAAT GCTAACATGT TGTACTCAGT ACTATGTCAC CGTCATTGAC 150 

GCCCCCGGTC ACCGTGATTT CATCAAGAAC ATGATCACTG GTACCTCCCA 200 

GGCCGACTGC GCTATTCTCA TCATTGCCGC CGGTACTGGT GAGTTCGAGG 250 

CTGGTATCTC CAAGGATGGC CAGACTCGTG AGCACGCTCT CCTCGCTTAC 300 

60 ACCCTCGGTG TCAAGCAGCT CATCGTTGCC ATCAACAAGA TGGACACCAC 350 

240 
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CAAGTGGTCC GAGGAGCGTT ACCAGGAGAT 
TCATCAAGAA GGTCGGCTAC AACCCCAAGC 
TCCGGTTTCA ACGGTGACAA CATGATTGAG 
GTACAAGGGT TGGGAGAAGG AGACCAAGGC 
5 TCCTCGAGGC CATCGACGCC ATCGACCCTY 
CCCCTCCGTC TTCCCCTYCA GGATGTTTAC 
GGTGCCCGTC GGTCGTGTCG AGACCGGTAT 
TCACCTTCGC CCCCGCTGGT GTCACCACTG 
CACCACGAGC AGCTCACCGA GGGTGTCCCC 

10 CGTCAAGAAC GTCTCCGTCA AGGAGATCCG 
ACTCCAAGAA CGACCCCCCC AAGGGTGCCG 
ATCGTCCTCA ACCACCCTGG TCAGGTCGGT 
CGACTGCCAC ACCGCCCACA TTGCTTGCAA 
AGATTGACCG CCGTACCGGA AAGTCTGTTG 

15 AAGTCCGGTG ACGCCGCCAT CGTCAAGATG 
CGTTGAGGCT TTCACTGACT ACCCTCCTCT 
A 



PCT/CA00/01150 

CATCAAGGAG ACCTCCAACT 4~00 

ACGTTCCCTT CGTCCCCATC 450 

GCCTCATCCA ACTGCCCCTG 500 

CAAGGCCACT GGTAAGACCC 550 

CCAGCCGTCC CACCGACAAG 600 

AAGATTGGTG GTATTGGCAC 650 

CATCAAGGCC GGTATGGTCG 700 

AAGTCAAGTC CGTCGAGATG 750 

GGTGACAACG TCGGCTTCAA 800 

TCGTGGTAAC GTTGCCGGTG 850 

AGTCCTTCAA CGCCCAGGTC 900 

GCTGGTTACG CCCCAGTCCT 950 

GTTCTCTGAG CTCCTCGAGA 1000 

AGAACTCTCC CAAGTTCATC 1050 

GTTCCCTCCA AGCCCATGTG 1100 

CGGTCGTTTC GCTGTCCGTG 1150 

1151 



20 



2) INFORMATION FOR SEQ ID NO: 4 02 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1283 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus flaws 

(B) STRAIN: ATCC 26947 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402 

CAAGCTCAAG TCCGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCCCTCT 50 

GGAAGTTCCA GACCTCCAAG TATGAGGTCA CCGTCATTGG TAAGCATTTG 100 

AGTTCCAACC TACGTTGCCC AACATTTACA GTCATCTAAC AAAGTTCAAT 150 

40 AGATGCCCCC GGTCACCGTG ACTTCATCAA GAACATGATC ACTGGTACTT 2 00 

CCCAGGCTGA CTGCGCTATC CTCATCATTG CCTCCGGTAC TGGTGAATTC 250 

GAGGCTGGTA TCTCCAAGGA TGGTCAGACC CGTGAGCACG CTCTGCTCGC 300 

TTTCACCCTC GGTGTCCGTC AGCTCATCGT TGCCCTCAAC AAGATGGACA 350 

CCTGCAAGTG GTCTCAGGAT CGTTACAACG AAATCGTTAA GGAGACTTCC 400 

45 AACTTCATCA AGAAGGTCGG ATACAACCCC AAGAGCGTTC CTTTCGTCCC 450 

CATCTCCGGT TTCAACGGTG ACAACATGAT TGAGGCCTCC ACCAACTGCC 500 

CCTGGTACAA GGGTTGGGAG AAGGAGACCA AGGCTGGCAA GTCCACCGGT 550 

AAGACCCTTC TCGAGGCCAT CGATGCCATC GAGCCCCCCG TCCGTCCCAC 600 

CGACAAGCCT CTCCGTCTTC CCCTYCAGGA TGTCTACAAG ATCTCTGGTA 650 

50 TCGGTACTGT GCCCGTCGGT CGTGTCGAGA CTGGTGTCAT CAAGCCTGGT 700 

ATGGTCGTTA CTTTCGCTCC TGCCAACGTG ACCACTGAAG TCAAGTCCGT 750 

TGAAATGCAC CACCAGCAGC TCCAGGCCGG TAACCCCGGT GACAACGTTG 800 

GTTTCAACGT CAAGAACGTC TCCGTCAAGG AAGTCCGCCG TGGTAACGTT 850 

GCCGGTGACT CCAAGAACGA CCCCCCTGCT GGCTGCGATT CCTTCAACGC 900 

55 CCAGGTCATC GTCCTTAACC ACCCCGGTCA GGTCGGCAAC GGTTACGCTC 950 

CCGTCCTGGA CTGCCACACC GCTCACATTG CTTGCAAGTT CGCTGAGCTC 1000 

CTTGAGAAGA TTGACCGCCG TACCGGTAAA TCTGTTGAGG ACAAGCCCAA 1050 

GTTCATCAAG TCTGGTGATG CTGCCATCGT CAAGATGATT CCCTCCAAGC 1100 

CCATGTGTGT GGAGTCTTTC ACTGACTTCC CCCCTCTTGG TCGTTTCGCT 1150 

60 GTCCGTGACG TAAGTTTTTC CCTCTTGACT ATCTTCACAA TTTTTCACAT 12 00 

241 
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ATTTTCACGC CTCGTCCCAC TCTTTTTCCT CCCTTCCTCT TTGGTTCCCC 1250 
TTTTTGCCTG CAAGTTCTCT ATAGCTAACA TGA 12 83 



2) INFORMATION FOR SEQ ID NO: 403 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1103 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: DAL95 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 403 

TCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCCCTCT GGAAGTTCCA 50 

GACTCCCAAG TATGAGGTCA CTGTCATCGG TAAGCTCGAC TCGCCCCGAT 100 

ATGTTTTGGT GCTGTAGCTA ACACGATCTG AAGATGCCCC CGGTCACCGT 150 

GACTTCATCA AGAACATGAT CACTGGTACC TCCCAGGCTG ACTGCGCTAT 200 

CCTCATCATT GCCTCCGGTA CTGGTGAGTT CGAGGCTGGT ATCTCCAAGG 250 

ATGGCCAGAC CCGTGAGCAC GCTCTGCTGG CTTTCACCCT CGGTGTCAAG 300 

CAGCTCATCG TCGCCCTCAA CAAGATGGAC ACCTGCAAGT GGTCCGAGGA 350 

TCGTTACAAC GAAATTGTCA AGGAAACCTC CAACTTCATC AAGAAGGTCG 400 

GCTACAACCC CAAGGCCGTT CCCTTCGTCC CCATCTCTGG CTTCAACGGT 450 

GACAACATGC TTGAGCCCTC CTCCAACTGC CCCTGGTACA AGGGATGGGA 500 

GAAGGAGACC AAGGCCGGCA AGGTCACTGG TAAGACCCTC ATCGAGGCCA 550 

TCGACGCCAT TGAGCCCCCT GTCCGTCCCT CCAACAAGCC CCTCCGTCTT 600 

CCCCTCCAGG ATGTCTACAA GATCTCTGGT ATCGGAACGG TCCCTGTCGG 650 

CCGTGTCGAG ACCGGTATCA TCAAGCCCGG CATGGTCGTC ACCTTCGCCC 700 

CCGCCAACGT CACCACTGAA GTCAAGTCCG TCGAGATGCA CCACCAGCAG 750 

CTCCAGGAGG GTGTCCCCGG TGACAACGTC GGTTTCAACG TCAAGAACGT 800 

TTCCGTCAAG GAAGTCCGCC GTGGTAACGT CTGCGGTGAC TCCAAGAACG 850 

ATCCCCCTCA GGGTGCTGCC TCCTTCAACG CCCAGGTCAT CGTCCTCAAC 900 

CACCCCGGTC AGGTCGGCGC TGGTTACGCC CCCGTCCTCG ACTGCCACAC 950 

TGCCCACATT GCTTGCAAGT TCTCTGAGCT GCTTGAGAAG ATTGACCGCC 1000 

GTACCGGCAA GTCTGTTGAG AACAACCCCA AGTTCATCAA GTCCGGTGAT 1050 

GCCGCCATCG TGAAGATGGT TCCTTCCAAG CCCATGTGTG TCGAGTCCTT 1100 

CAC 1103 



2) INFORMATION FOR SEQ ID NO: 404 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1149 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: WSA-172 

242 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404 

AAGCTCAAGT CCGAGCGTGA GCGTGGTATC ACCATCGACA TTGCCCTCTG 50 

5 GAAGTTCCAG ACTCCCAAGT ATGAGGTCAC TGTCATCGGT AAGCTCGACT 100 

CGCCCCGATA TGTTTTGGTG CTGTAGCTAA CACGATCTGA AGATGCCCCC 150 

GGTCACCGTG ACTTCATCAA GAACATGATC ACTGGTACCT CCCAGGCTGA 2 00 

CTGCGCTATC CTCATCATTG CCTCCGGTAC TGGTGAGTTC GAGGCTGGTA 2 50 

TCTCCAAGGA TGGCCAGACC CGTGAGCACG CTCTGCTGGC TTTCACCCTC 3 00 

10 GGTGTCAAGC AGCTCATCGT CGCCCTCAAC AAGATGGACA CCTGCAAGTG 3 50 

GTCCGAGGAT CGTTACAACG AAATTGTCAA GGAAACCTCC AACTTCATCA 400 

AGAAGGTCGG CTACAACCCC AAGGCCGTTC CCTTCGTCCC CATCTCTGGC 450 

TTCAACGGTG ACAACATGCT TGAGCCCTCC TCCAACTGCC CCTGGTACAA 500 

GGGATGGGAG AAGGAGACCA AGGCCGGCAA GGTCACTGGT AAGACCCTCA 550 

15 TCGAGGCCAT CGACGCCATT GAGCCCCCTG TCCGTCCCTC CAACAAGCCC 600 

CTCCGTCTTC CCCTCCAGGA TGTCTACAAG ATCTCTGGTA TCGGAACGGT 650 

CCCTGTCGGC CGTGTCGAGA CCGGTATCAT CAAGCCCGGC ATGGTCGTCA 700 

CCTTCGCCCC CGCCAACGTC ACCACTGAAG TCAAGTCCGT CGAGATGCAC 750 

CACCAGCAGC TCCAGGAGGG TGTCCCCGGT GACAACGTCG GTTTCAACGT 800 

20 CAAGAACGTT TCCGTCAAGG AAGTCCGCCG TGGTAACGTC TGCGGTGACT 850 

CCAAGAACGA TCCCCCTCAG GGTGCTGCCT CCTTCAACGC CCAGGTCATC 900 

GTCCTCAACC ACCCCGGTCA GGTCGGCGCT GGTTACGCCC CCGTCCTCGA 950 

CTGCCACACT GCCCACATTG CTTGCAAGTT CTCTGAGCTG CTTGAGAAGA 1000 

TTGACCGCCG TACCGGCAAG TCTGTTGAGA ACAACCCCAA GTTCATCAAG 1050 

25 TCCGGTGATG CCGCCATCGT GAAGATGGTT CCTTCCAAGC CCATGTGTGT 1100 

CGAGTCCTTC ACTGACTACC CCCCTCTGGG TCGTTTCGCC GTCCGTGAC 1149 



30 2) INFORMATION FOR SEQ ID NO: 405 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1151 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus niger 

(B) STRAIN: ATCC 9508 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405 



CAAGCTCAAG TCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCCCTCT 50 

GGAAGTTCCA GACTGGCAAG TATGAGGTCA CCGTCATTGG TATGTACTCA 100 

CAGAGTTCTC TTTTCATCAA AGCAATATAC TAACGTCCAT CATAGACGCC 150 

CCCGGTCACC GTGACTTCAT CAAGAACATG ATCACTGGTA CCTCCCAGGC 200 

50 TGACTGCGCT ATCCTCATCA TTGCCTCCGG TACTGGTGAG TTCGAGGCTG 250 

GTATCTCCAA GGATGGCCAG ACTCGTGAGC ACGCTCTGCT TGCTTTCACC 3 00 

CTCGGTGTCC GCCAGCTCAT CGTTGCCCTC AACAAGATGG ACACCTGCAA 3 50 

GTGGTCCGAG GACCGTTACA ACGAAATCGT TAAGGAGACC TCCAACTTCA 4 00 

TCAAGAAGGT CGGATACAAC CCCAAGGGTG TTCCTTTCGT CCCCATCTCC 4 50 

55 GGTTTCAACG GTGACAACAT GCTCGAGCCC TCCCCCAACT GCCCCTGGTA 500 

CAAGGGTTGG GAGAAGGAGA CCAAGGCCGG CAAGGTCACC GGTAAGACCC 550 

TCCTTGAGGC CATCGACGCC ATCGAGCCCC CCGTCCGTCC CTCCAACAAG 600 

CCCCTCCGTC TTCCCCTCCA GGATGTCTAC AAGATCTCCG GTATTGGAAC 650 

TGTTCCCGTC GGTCGTGTCG AGACCGGTAT CATTGCCCCT GGTATGGTCG 700 

60 TGACCTTCGC TCCCGCCAAC GTCACCACTG AAGTCAAGTC CGTTGAGATG 750 

243 
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CACCACCAGC AGCTCAAGGA AGGTGTCCCC 
CGTCAAGAAC GTTTCCGTCA AGGAGGTTCG 
ACTCCAAGAA CGACCCCCCT CTTGGCTGTG 
ATCGTCCTCA ACCACCCCGG TCAGGTCGGC 
5 GGACTGCCAC ACTGCTCACA TTGCTTGCAA 
AGATTGACCG CCGTACCGGA AAGTCTGTTG 
AAGTCCGGTG ACGCTGCCAT CGTCAAGATG 
TGTTGAGGCT TTCACTGACT ACCCCCCTCT 
A 
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GGTGACAACG TTGGTTTCAA " 800 

CCGTGGTAAC GTTGCCGGTG 850 

AGAGCTTCAC CGCCCAGGTC 900 

GCTGGTTACG CTCCCGTCCT 950 

GTTCGCTGAG CTCCTTGAGA 1000 

AATCTTCCCC CAAGTTCATC 1050 

ATTCCCTCCA AGCCCATGTG 1100 

TGGTCGTTTC GCCGTCCGCG 1150 

1151 



2) INFORMATION FOR SEQ ID NO: 406 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1093 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastoschizomyces capitatus 
25 (B) STRAIN: ATCC 10663 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 06 

GCTTAAAGCT GAACGTGAAC GTGGTATCAC CATTGATATC GCTCTCTGGA 50 

30 AGTTCGAAAC TCCTAAGTAC TACGTTACTG TTATTGATGC TCCAGGTCAC 100 

CGTGATTTCA TCAAGAACAT GATTACTGGT ACTTCCCAAG CCGATTGCGC 150 

CATTCTTATC ATTGCTGCCG GTGTCGGTGA ATTCGAAGCT GGTATCTCCA 200 

AGGAAGGTCA AACCAGAGAA CACGCTCTTC TCGCTTTCAC CCTTGGTGTC 250 

AGACAACTTA TCATTGCCAT CAACAAGATG GACTCTGTCA AGTGGGACCA 300 

35 AAAGAGATAC GAAGAAATCG TCAAGGAGGC TTCCAACTTC GTCAAGAAGG 350 

TTGGTTACAA CCCCAAGTCT GTTCCATTCG TTCCTATCTC TGGTTGGAAC 400 

GGTGACAACA TGTTGGAACC TACCACCAAC GCCCCATGGT ACAAGGGATG 450 

GACCAAGGAA ACCAAGGCTG GTGCCACTAA GGGTATGACT CTTATTGAAG 500 

CCATTGACGC CATTGAACCA CCAGTAAGAC CATCCGACAA GCCACTCCGT 550 

40 CTCCCACTCC AAGATGTTTA CAAGATTGGT GGTATCGGAA CTGTGCCAGT 600 

CGGCCGTGTC GAAACCGGTA TCATCAAGGC CGGTATGGTC GTTACCTTTG 650 

CTCCACCAAT GGTCACAACT GAAGTTAAGT CCGTTGAAAT GCACCACGAA 700 

CAACTTGCTC AAGGTAACCC AGGTGACAAC GTTGGTTTCA ACGTCAAGAA 750 

CGTTTCCGTT AAGGAAATCA GACGTGGTAA CGTCTGTGGT GACTCCAAGA 800 

45 ACGATCCACC AAAGGGCTGC GAATCTTTCA ACGCTCAAGT TATCGTCTTG 850 

AACCACCCTG GTCAAATCTC TGCTGGTTAC TCTCCAGTTC TCGATTGCCA 900 

CACTGCCCAC ATTGCCTGCA GATTCGACGA ACTCCTTGAA AAGATCGACC 950 

GTCGTTCTGG TAAGAAGATT GAAGACTCTC CAAAGTTTGT CAAGTCTGGT 1000 

GATGCCGCTA TCGTCAAGAT GATCCCAACC AAGCCAATGT GCGTTGAAAC 1050 

50 CTTCACTGAA TACCCACCAC TTGGTCGTTT CGCCGTCCGT GAT 1093 



2) INFORMATION FOR SEQ ID NO: 407 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 

244 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

(ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 10231 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407 

10 CTTGGACAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAATATG ATCACTGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 2 00 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 250 

15 TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 3 00 

ATGGGACAAA AACAGATTTG AAGAAATCAT CAAGGAAACC TCCAACTTCG 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTYGT TCCAATCTCT 400 

GGTTGGAATG GTGACAACAT GATTGAACCA TCCACCAACT GTCCATGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAAGTTACT GGTAAGACCT 500 

20 TGTTAGAAGC TATTGACGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 

CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACTGGTAT CATCAAAGCC GGTATGGTTG 650 

TTACTTTCGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 700 

CATCACGAAC AATTGGCTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

25 CGTTAAGAAC GTTTCCGTTA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

ATTGTTTTGA ACCATCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTCTT 900 

GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 950 

AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 1000 

30 AAATCCGGTG ATGCTGCTAT CGTCAAGATG GTCCCAACCA AACCAATGTG 1050 

TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC GCTGTCAGAG 1100 

A 1101 



35 



45 



2) INFORMATION FOR SEQ ID NO: 408 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1089 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 18804 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 408 

GAAGGCTGAA AGAGAAAGAG GTATCACCAT TGATATCGCT TTGTGGAAAT 50 

TCGAAACTCC AAAATACCAC GTTACCGTCA TTGATGCTCC AGGTCACAGA 100 

GATTTCATCA AGAATATGAT CACTGGTACT TCTCAAGCTG ATTGTGCTAT 150 

55 TTTGATTATT GCTGGTGGTA CTGGTGAATT CGAAGCCGGT ATTTCTAAGG 200 

ATGGTCAAAC CAGAGAACAC GCTTTGTTGG CTTACACTTT GGGTGTCAAA 250 

CAATTGATTG TTGCTGTCAA CAAGATGGAC TCTGTCAAAT GGGACAAAAA 300 

CAGATTTGAA GAAATCATCA AGGAAACCTC CAACTTCGTC AAGAAGGTTG 350 

GTTACAACCC AAAGACTGTT CCATTCGTTC CAATCTCTGG TTGGAATGGT 400 

60 GACAACATGA TTGAACCATC CACCAACTGT CCATGGTACA AGGGTTGGGA 450 

245 
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AAAGGAAACC 
TTGACGCTAT 
CCATTGCAAG 
TAGAGTTGAA 
CAGCTGGTGT 
TTGGCTGAAG 
TTCCGTTAAA 
ATCCACCAAA 
CATCCAGGTC 
TGCCCACATT 
GAACTGGTAA 
GCTGCTATCG 
CACTGACTAC 



AAATCCGGTA 
TGAACCACCA 
ATGTTTACAA 
ACTGGTATCA 
TACCACTGAA 
GTGTTCCAGG 
GAAATTAGAA 
GGGTTGTGAC 
AAATCTCTGC 
GCTTGTAAAT 
GAAATTGGAA 
TCAAGATGGT 
CCACCATTAG 



AAGTTACTGG 
ACCAGACCAA 
GATCGGTGGT 
TCAAAGCCGG 
GTCAAATCCG 
TGACAATGTT 
GAGGTAACGT 
TCTTTCAATG 
TGGTTACTCT 
TCGACACTTT 
GAAAATCCAA 
CCCAACCAAA 
GTAGATTCGC 



TAAGACCTTG TTAGAAGCTA 
CCGACAAACC ATTGAGATTG 
ATTGGTACTG TGCCAGTCGG 
TATGGTTGTT ACTTTCGCCC 
TTGAAATGCA TCACGAACAA 
GGTTTCAACG TTAAGAACGT 
TTGTGGTGAC TCCAAGAACG 
CCCAAGTCAT TGTTTTGAAC 
CCAGTCTTGG ATTGTCACAC 
GGTTGAAAAG ATTGACAGAA 
AATTCGTCAA ATCCGGTGAT 
CCAATGTGTG TTGAAGCTTT 
TGTCAGAGA 



500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1089 



2) INFORMATION FOR SEQ ID NO: 409 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 56884 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409 

CTTGGACAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAATATG ATCACTGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 2 00 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 2 50 

TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 300 

ATGGGACAAA AACAGATTTG AAGAAATCAT CAAGGAAACC TCCAACTTCG 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 4 00 

GGTTGGAATG GTGACAACAT GATTGAACCA TCCACCAACT GTCCATGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAAGTTACT GGTAAGACCT 500 

TGTTAGAAGC TATTGACGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 

CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACTGGTAT CATCAAAGCC GGTATGGTTG 650 

TTACTTTCGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 700 

CATCACGAAC AATTGGCTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

CGTTAAGAAC GTTTCCGTTA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

ATTGTTTTGA ACCATCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTCTT 900 

GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 950 

AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 10 00 

AAATCCGGTG ATGCTGCTAT CGTCAAGATG GTCCCAACCA AACCAATGTG 1050 

TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC GCTGTCAGAG 1100 

A 1101 



2) INFORMATION FOR SEQ ID NO: 410 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 60193 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410 

15 CTTGGACAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAATATG ATCACTGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 2 00 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 250 

2 0 TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 3 00 

ATGGGACAAA AACAGATTTG AAGAAATCAT CAAGGAAACC TCCAACTTCG 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 400 

GGTTGGAATG GTGACAACAT GATTGAACCA TCCACCAACT GTCCATGGTA 4 50 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAAGTTACT GGTAAGACCT 500 

25 TGTTAGAAGC TATTGACGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 

CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACTGGTAT CATCAAAGCC GGTATGGTTG 650 

TTACTTTCGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 700 

CATCACGAAC AATTGGCTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

30 CGTTAAGAAC GTTTCCGTTA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

ATTGTTTTGA ACCATCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTCTT 900 

GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 950 

AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 1000 

35 AAATCCGGTG ATGCTGCTAT CGTCAAGATG GTCCCAACCA AACCAATGTG 1050 

TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC GCTGTCAGAG 1100 

AT 1102 



4.0 



50 



2 ) INFORMATION FOR SEQ ID NO: 411 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1102 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 90028 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411 

CTTGGACAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAATATG ATCACTGGTA CTTCTCAAGC 150 

60 TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 200 

247 
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GTATTTCTAA 
TTGGGTGTCA 
ATGGGACAAA 
TCAAGAAGGT 
5 GGTTGGAATG 
CAAGGGTTGG 
TGTTAGAAGC 
CCATTGAGAT 
TGTGCCAGTC 

10 TTACTTTCGC 
CATCACGAAC 
CGTTAAGAAC 
ACTCCAAGAA 
ATTGTTTTGA 

15 GGATTGTCAC 
AGATTGACAG 
AAATCCGGTG 
TGTTGAAGCT 
AT 



2) INFORMATION FOR SEQ ID NO: 412 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubliniensis 

(B) STRAIN: NCPF 3108 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412 

CTTGGATAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

4 0. CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAACATG ATCACTGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 200 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 2 50 

TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 300 

45 ATGGGACAAA AACAGATTTG AAGAAATCAT CAAGGAAACC TCTAACTTCG 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 400 

GGTTGGAATG GTGACAACAT GATTGAAGCT TCCACCAACT GTCCATGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAGGTTACT GGTAAGACCT 500 

TGTTAGAAGC TATTGATGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 

50 CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACTGGTGT CATTAAAGCC GGTATGGTTG 650 

TCACTTTTGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 700 

CATCACGAAC AATTGGCTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

CGTTAAGAAT GTTTCTGTCA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

55 ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

ATTGTCTTGA ACCATCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTCTT 900 

GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 950 

AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 1000 

AAATCCGGTG ACGCTGCTAT YGTCAAGATG GTCCCAACCA AACCAATGTG 1050 

60 TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC GCTGTCAGAG 1100 

248 
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GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 25(f 
AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 3 00 

AACAGATTTG AAGAAATCAT CAAGGAAACC TCCAACTTCG 350 

TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 4 00 

GTGACAACAT GATTGAACCA TCCACCAACT GTCCATGGTA 4 50 
GAAAAGGAAA CCAAATCCGG TAAAGTTACT GGTAAGACCT 500 
TATTGACGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 
TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 
GGTAGAGTTG AAACTGGTAT CATCAAAGCC GGTATGGTTG 650 
CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 700 
AATTGGCTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 
GTTTCCGTTA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 
CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 
ACCATCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTCTT 900 
ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 950 

AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 1000 

ATGCTGCTAT CGTCAAGATG GTCCCAACCA AACCAATGTG 1050 

TTCACTGACT ACCCACCATT AGGTAGATTC GCTGTCAGAG 1100 

1102 



25 



30 



35 



WO 01/23604 

A 



PCT/CA00/01150 
1101 



5 2) INFORMATION FOR SEQ ID NO: 413 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida catenulata 

(B) STRAIN: ATCC 10565 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 413 



GGACAAGCTT AAGGCTGAGC GTGAGAGAGG TATCACCATT GACATTGCCT 50 

TGTGGAAGTT CGAGACTCCC AAGTACCACG TCACTGTCAT TGACGCCCCC 100 

GGTCACAGAG ATTTCATCAA GAACATGATC AC TGGTACC T CGCAGGCTGA 150 

CTGTGCTATC TTGATCATTG CTTCCGGTGT CGGTGAGTTC GAGGCTGGTA 2 00 

25 TCTCCAAGGA CGGTCAGACC CGTGAGCACG CCTTGTTGGC CTACACCTTG 250 

GGTGTCAAGC AGTTGATCGT TGCCATCAAC AAGATGGACT CCGTCAAGTG 3 00 

GGACAAGAAC AGATTCGAGG AGATTGTCAA GGAGACCACC AACTTCGTCA 350 

AGAAGGTTGG TTACAACCCC AAGGCTGTCC CCTTCGTCCC CATCTCTGGC 400 

TGGAACGGTG ACAACATGAT TGAGGCCTCC ACCAACTGCC CCTGGTACAA 450 

30 GGGCTGGGAG AAGGAGACCA AGGCCGGTAA GTCTACCGGT AAGACCTTGT 500 

TGGAGGCCAT TGACGCCATT GAGCCCCCTA CCAGACCCAC CGACAAGCCC 550 

TTGAGATTGC CCTTGCAGGA TGTCTACAAG ATCGGTGGTA TTGGTACGGT 600 

GCCCGTCGGC CGTGTCGAGA CCGGTGTCAT CAAGCCCGGT ATGGTCGTCA 650 

CCTTCGCCCC CGCTGGTGTC ACCACTGAAG TCAAGTCCGT CGAGATGCAC 700 

35 CACGAGCAGT TGTCCGAGGG TGTCCCCGGT GACAACGTTG GTTTCAACGT 750 

CAAGAACGTC TCTGTTAAGG AGATCAGACG TGGTAACGTC TGCGGTGACT 800 

CCAAGAACGA CCCCCCCATG GGTTGCTCTT CTTTCAACGC CCAGGTTATC 850 

GTGTTGAACC ACCCCGGTCA GATCTCTGCC GGTTACTCTC CCGTCTTGGA 900 

CTGCCACACC GCCCACATTG CTTGCAAGTT CGCCGAGTTG ATCGAGAAGA 950 

40 TTGACAGACG TACCGGTAAG TCCATGGAGG CTAACCCCAA GTTCGTCAAG 1000 

TCTGGTGACG CCGCCATCGT CAAGATGGAG CCCACTAAGC CCATGTGTGT 1050 

TGAGGCCTTC ACTGACTTCC CTCCTTTGGG TAGATTCGCC GTCAGAGA 1098 



45 



55 



2) INFORMATION FOR SEQ ID NO: 414 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1102 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubliniensis 

(B) STRAIN: NCPF 3949 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414 

249 
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CTTGGATAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAACATG ATCACTGGTA CTTCTCAAGC 150 

5 TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 200 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 250 

TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 300 

ATGGGACAAA AACAGATTCG AAGAAATCAT CAAGGAAACC TCTAACTTCG 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 400 

10 GGTTGGAATG GTGACAACAT GATTGAAGCT TCCACCAACT GTCCATGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAGGTTACT GGTAAGACCT 500 

TGTTAGAAGC TATTGATGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 

CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACTGGTGT CATTAAAGCC GGTATGGTTG 650 

15 TCACTTTTGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 700 

CATCACGAAC AATTGGTTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

CGTTAAGAAT GTTTCTGTCA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

ATTGTCTTGA ACCATCCAGG TCAAATCTCT GCTGGTTAYT CTCCAGTCTT 900 

20 GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 950 

AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 1000 

AAATCCGGTG ACGCTGCTAT YGTCAAGATG GTCCCAACCA AACCAATGTG 1050 

TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC GCTGTCAGAG 1100 

AT 1102 

25 



2) INFORMATION FOR SEQ ID NO: 415 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubliniensis 
40 (B) STRAIN: CBS 7987 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415 

CTTGGATAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

45 CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAACATG ATCACTGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 200 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 250 

TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 300 

50 ATGGGACAAA AACAGATTYG AAGAAATCAT CAAGGAAACC TCTAACTTCG 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 400 

GGTTGGAATG GTGACAACAT GATTGAAGCT TCCACCAACT GTCCATGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAGGTTACT GGTAAGACCT 500 

TGTTAGAAGC TATTGATGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 

55 CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACTGGTGT CATTAAAGCC GGTATGGTTG 650 

TCACTTTTGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 700 

CATCACGAAC AATTGGTTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

CGTTAAGAAT GTTTCTGTCA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

60 ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

250 
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ATTGTCTTGA ACCATCCAGG TCAAATCTCT 
GGATTGTCAC ACTGCCCACA TTGCTTGTAA 
AGATTGACAG AAGAACTGGT AAGAAATTGG 
AAATCCGGTG ACGCTGCTAT YGTCAAGATG 
5 TGTTGAAGCT TTCACTGACT ACCCACCATT 
AT 



PCT/CA00/01150 

GCTGGTTAYT CTCCAGTCTT 900 
ATTCGACACT TTGGTTGAAA 950 
AAGAAAATCC AAAATTCGTC 1000 
GTCCCAACCA AACCAATGTG 1050 
AGGTAGATTC GCTGTCAGAG 1100 

1102 



10 2) INFORMATION FOR SEQ ID NO: 416 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1094 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida famata 

(B) STRAIN: ATCC 628 94 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 416 



AATTGAAGGC TGAAAGAGAA AGAGGTATCA CCATTGATAT CGCTTTATGG 50 

AAATTCGAAA CTCCAAAATA CCACGTTACC GTTATTGATG CTCCAGGTCA 10 0 

CAGAGATTTC ATCAAGAACA TGATTACTGG TACTTCTCAA GCTGATTGTG 150 

CTATTTTRAT TATTGCTGGT GGTGTCGGTG AATTCGAAGC CGGTATCTCT 20 0 

3 0 AAGGATGGTC AAACCAGAGA ACACGCTTTA TTGGCTTACA CCTTAGGTGT 250 

TAGACAATTG ATTGTTGCCG TCAACAAGAT GGACTCTGTT AAATGGGACA 300 

AGGCTAGATT CGAAGAAATC ATCAAGGAAA CCTCTAACTT CGTCAAGAAG 350 

GTTGGTTACA ACCCTAAGAC TGTTCCTTTC GTYCCAATTT CTGGATGGAA 4 00 

CGGTGACAAC ATGATTGAAG CCTCCACCAA CTGTCCATGG TACAAGGGTT 450 

35 GGGAAAAGGA AACCAAGGCT GGTAAATCTA CTGGTAAGAC TTTGTTAGAA 500 

GCCATTGATG CCATTGAACC ACCAACCAGA CCAACCGAAA AGCCATTGAG 550 

ATTACCATTA CAAGATGTCT ACAAGATCGG TGGTATTGGT ACTGTGCCAG 600 

TCGGTAGAGT TGAAACCGGT GTTATCAAGG GTGGTATGGT TGTTACCTTT 650 

GCCCCAGCCG GTGTCACTAC CGAAGTCAAA TCCGTTGAAA TGCACCACGA 700 

4 0 ACAATTAGCT GAAGGTGTTC CAGGTGACAA TGTTGGTTTC AACGTCAAGA 750 

ACGTTTCCGT TAAGGAAATC AGAAGAGGTA ACGTTTGTGG TGACTCCAAG 800 

AACGACCCAC CAAAGGGTGC TGAATCTTTC ACCGCTCAAG TTATTGTCTT 850 

GAACCACCCA GGTCARATCT CTGCTGGTTA CTCTCCAGTC TTAGATTGTC 900 

ACACCGCCCA CATTGCTTGT AAATTCGATG CTTTACTCGA AAAGATTGAC 950 

4 5 AGAAGATCCG GTAAGAAATT AGAAGACGAA CCAAAATTCG TCAAGTCCGG 1000 

TGATGCTGCT ATCGTCAAGA TGGTCCCAAC CAAACCAATG TGTGTTGAAG 1050 

CTTTCACTGA ATACCCACCA TTAGGTAGAT TCGCTGTTAG AGAT 1094 



50 



60 



2) INFORMATION FOR SEQ ID NO: 417 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1101 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

251 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida glabrata 

(B) STRAIN: ATCC 66032 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417 

TTTGGACAAG TTGAAGGCTG AAAGAGAAAG AGGTATCACT ATCGATATCG 50 

CTTTGTGGAA GTTCGAAACT CCAAAGTACC ACGTYACCGT TATCGATGCY 100 

CCAGGTCACA GAGATTTCAT CAAGAACATG ATTACTGGTA CTTCTCAAGC 150 

10 TGACTGTGCT ATCTTGATTA TTGCTGGTGG TGTCGGTGAA TTCGAAGCYG 200 

GTATCTCCAA GGATGGTCAA ACCAGAGAAC ACGCTCTATT GGCTTTCACC 250 

CTAGGTGTTA GACAATTGAT TGTYGCTGTC AACAAGATGG ACTCTGTCAA 300 

GTGGGATGAA TCCAGATTCG CTGAAATCGT TAAGGAAACC TCCAACTTCA 350 

TCAAGAAGGT CGGTTACAAC CCAAAGACTG TTCCATTCGT CCCAATCTCT 400 

15 GGTTGGAACG GTGACAACAT GATTGAAGCC ACCACCAACG CTTCCTGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAGGCTGG TGTCGTCAAG GGTAAGACCT 500 

TGTTGGAAGC CATTGACGCT ATCGAACCAC CAACCAGACC AACTGACAAG 550 

CCATTGAGAT TGCCATTGCA AGATGTCTAC AAGATCGGTG GTATCGGTAC 600 

GGTGCCAGTC GGTAGAGTCG AAACCGGTGT CATCAAGCCA GGTATGGTTG 650 

2 0 TTACCTTCGC CCCAGCTGGT GTTACCACTG AAGTCAAGTC CGTTGAAATG 700 

CACCACGAAC AATTGACTGA AGGTTTGCCA GGTGACAACG TTGGTTTCAA 750 

CGTTAAGAAC GTTTCCGTTA AGGAAATCAG AAGAGGTAAT GTCTGTGGTG 800 

ACTCCAAGAA CGACCCACCA AAGGCTGCTG CTTCTTTCAA CGCTACCGTC 850 

ATTGTCTTGA ACCACCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTTTT 900 

25 GGACTGTCAC ACCGCCCACA TTGCTTGTAA GTTCGAAGAA TTGTTGGAAA 950 

AGAACGACAG AAGATCCGGT AAGAAGTTGG AAGACTCTCC AAAGTTCTTG 1000 

AAGTCCGGTG ACGCTGCTTT GGTTAAGTTC GTTCCATCCA AGCCAATGTG 1050 

TGTCGAAGCT TTCTCCGACT ACCCACCATT GGGTAGATTC GCTGTCAGAG 1100 

A 1101 

30 



2) INFORMATION FOR SEQ ID NO: 418 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1080 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida guilliermondii 
45 (B) STRAIN: ATCC 6260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418 

AGAGAAAGAG GTATCACCAT TGACATTGCT TTGTGGAAAT TCGAGACTCC 50 

50 AAAGTACCAC GTTACYGTCA TTGATGCCCC AGGTCACAGA GATTTCATCA 100 

AGAACATGAT CACTGGTACT TCTCAAGCTG ACTGTGCTAT TTTGATTATT 150 

GCTGGTGGTA CCGGTGAATT CGAAGCTGGT ATCTCTAAGG ATGGTCAAAC 200 

CAGAGAGCAC GCTTTGTTGG CTTACACCTT GGGTGTTAGA CAATTGATTG 250 

TTGCTGTCAA CAAGATGGAC TCCGTCAART GGGACAAGAA CAGATTYGAG 300 

55 GAAATCATCA AGGAAACCTC TAACTTCGTC AAGAAGGTTG GTTACAACCC 350 

TAAGACTGTG CCATTCGTTC CTATCTCTGG ATGGAAYGGT GACAACATGA 400 

TTGAGGCTTC TACCAACTGT CCTTGGTACA AGGGATGGGA GAAGGAGACC 450 

AAGGCTGGTA AGTCCACCGG TAAGACTTTG TTGGAGGCCA TTGACGCCAT 500 

TGAGCCACCT CAAAGACCAA CCGACAAGCC ATTGAGATTG CCATTGCAAG 550 

60 ATGTYTACAA GATTGGTGGT ATTGGAACGG TGCCAGTCGG TAGAGTTGAA 600 
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■ 

ACCGGTATCA TYAAGGCCGG TATGGTTGTT ACCTTTGCCC CAGCTGGTGT 650 

YACCACTGAA GTCAAGTCCG TGGAAATGCA CCACGAACAA TTGGTTGAAG 700 

GTGTTCCAGG TGACAATGTT GGTTTCAACG TTAAGAACGT TTCCGTTAAG 750 

GAAATTAGAA GAGGTAACGT TTGTGGTGAC TCCAAGAACG ACCCACCAAA 800 

GGGTTGTGAC TCTTTCACCG CTCAAGTTAT TGTGTTGAAC CACCCTGGTC 850 

AAATCTCTGC TGGTTACTCT CCAGTTTTGG ACTGTCACAC CGCCCACATT 900 

GCTTGTAAAT TCGACACCTT GTTGGAGAAG ATTGACAGAA GAACCGGTAA 950 

GAAGATGGAG GACAACCCCA AGTTTGTCAA GTCCGGTGAC GCTTCTATCG 1000 

TCAAGATGGT GCCATCCAAG CCAATGTGTG TTGAGGCTTT CACCGACTAC 1050 

CCACCATTGG GAAGATTCGC CGTCAGAGAC 1080 



2) INFORMATION FOR SEQ ID NO: 419 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida haemulonii 

(B) STRAIN: ATCC 22991 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419 



TCTGTCAAGT GGGACAAGGC CAGATACGAG 
TAACTTCGTC AAGAAGGTTG GTTACAACCC 
CAATCTCTGG TTGGAACGGT GACAACATGA 
GACTGGTACA AGGGTTGGGA GAAGGAGACC 
TAAGACCTTG TTGGAGGCCA TTGACGCCAT 
CCGACAAGCC ATTGAGATTG CCATTGCAGG 
ATCGGAACTG TGCCAGTCGG CAGAGTTGAG 
TATGGTTGTC ACCTTCGCCC CAGCTGGTGT 
TCGAGATGCA CCACGAGCAG TTGCCAGAGG 
GGTTTCAACG TCAAGAACGT TTCCGTTAAG 
CTGTGGTGAC TCCAAGCAGG ACCCACCAAA 
CTCAGGTTAT TGTGTTGAAC CACCCAGGTC 
CCAGTTTTGG ACTGTCACAC TGCCCACATT 
GGTTGAGAAG ATCGACAGAA GAACCGGTAA 
AGTTCATCAA GTCCGGTGAC GCTGCTATCG 
C 



GAAATCGTCA AGGAGACCTC 50 

TAAGACTGTT CCATTCGTCC 100 

TTGAGGCTTC TACCAACTGT 150 

AAGTCTGGTA AGTCCACCGG 2 00 

TGAGCCACCA ACCAGACCAA 250 

ATGTCTACAA GATTGGTGGT 3 00 

ACCGGTGTTA TCAAGGCCGG 350 

CACCACTGAA GTCAAGTCTG 4 00 

GTGTCCCAGG TGACAACGTT 450 

GAAATCAGAA GAGGTAACGT 500 

GGGCTGTGAC TCTTTCACCG 550 

AGATCTCTTC TGGTTACTCT 600 

GCTTGTAAGT TCGACACCTT 650 

GAAGTTGGAA GATGAGCCAA 7 00 

TCAAGATGGT CCCAACCAAG 750 

751 



2) INFORMATION FOR SEQ ID NO: 420 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida inconspicua 
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(B) STRAIN: ATCC 16783 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420 

5 TCTTGACAAG TTAAAGGCTG AAAGAGAAAG AGGTATCACT ATTGATATTG 50 

CTTTATGGAA ATTCGAAACT CCAAAGTATC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT TAAGAACATG ATTACTGGTA CTTCTCAAGC 150 

AGATTGTGCT ATTTTGATTA TTGCTGGTGG TGTCGGTGAA TTCGAAGCTG 2 00 

GTATTTCCAA GGATGGTCAA ACTAGAGAAC ACGCTTTATT AGCATTCACC 250 

10 TTAGGTGTTA AGCAATTGAT TGTTGCTATC AACAAGATGG ATTCTGTTAA 3 00 

GTGGGATGAA AAGAGATTTG AAGAAATTGT CAAGGAAACC CAAAACTTCA 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATTTCT 400 

GGTTGGAATG GTGACAACAT GATTGAACCA TCTTCTAACT GTCCATGGTA 450 

CAAGGGTTGG ACTAAGGAAA CCAAGGCAGG TGTTGTCAAG GGTAAGACCT 500 

15 TATTAGAAGC TATTGATGCT ATTGAACCAC CTGTCAGACC AACTGATAAG 550 

CCATTAAGAT TACCATTACA AGATGTTTAC AAGATTGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACCGGTAT TATTAAGCCA GGTATGGTTG 650 

TTGTTTTCGC ACCATCTGGT GTTACCACTG AAGTCAAGTC CGTTGAAATG 700 

CACCATGAAC AATTAGAAGA AGGTGTCCCA GGTGACAATG TTGGTTTCAA 750 

20 CGTCAAGAAC GTCTCTGTTA AGGATATCAA GAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA TGACCCACCA CAAGGTTGTG CTTCCTTCAA TGCTCAAGTC 850 

ATTGTCTTGA ACCACCCTGG TCAAATTTCT GCTGGTTACT CTCCAGTTTT 900 

AGATTGTCAC ACTGCCCACA TTGCATGTAA ATTCGATGAA TTAATTGAAA 950 

AGATTGACAG AAGAACTGGT AAGTCCGTTG AAGACCATCC AAAGTCTGTT 1000 

25 AAGTCTGGTG ATGCAGCTAT CGTTAAGATG GTTCCAACCA AGCCAATGTG 1050 

TGTTGAAGCT TTCACTGAAT ACCCACCATT AGGTAGATTC GCAGTCAGAG 1100 

AT 1102 



30 



40 



2) INFORMATION FOR SEQ ID NO: 421 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1099 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida kefyr 

(B) STRAIN: ATCC 28838 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421 

TGGACAAGTT AAAGGCTGAA AGAGAAAGAG GTATCACCAT CGATATCGCT 50 

TTGTGGAAGT TCGAAACTCC AAAGTACCAA GTTACCGTTA TCGATGCTCC 100 

AGGTCACAGA GATTTCATCA AGAACATGAT TACTGGTACT TCTCAAGCTG 150 

50 ACTGTGCTAT CTTGATTATT GCTGGTGGTG TCGGTGAATT CGAAGCCGGT 2 00 

ATCTCCAAGG ATGGTCAAAC CAGAGAACAC GCTTTGTTGG CTTACACCTT 250 

GGGTGTTAGA CAATTGATTG TTGCTATCAA CAAGATGGAC TCTGTTAAGT 3 00 

GGGATGAATC TCGTTACCAA GAAATTGTTA AGGAAACCTC CAACTTCATC 350 

AAGAAGGTCG GTTACAACCC AAAGAATGTT CCATTCGTCC CAATCTCTGG 400 

55 TTGGAACGGT GACAACATGA TTGAAGCCAC CACCAACGCT CCATGGTACA 4 50 

AGGGTTGGGA AAAGGAAACC AAGGCTGGTA CCGTCAAGGG TAAGACCTTG 500 

TTGGAAGCTA TTGACGCTAT CGAACCACCA ACCAGACCAA CTGACAAGCC 550 

ATTGAGATTG CCATTGCAAG ATGTCTACAA GATCGGTGGT ATTGGTACTG 600 

TGCCAGTCGG TAGAGTCGAA ACCGGTGTCA TCAAGCCAGG TATGGTTGTT 65 0 

60 ACCTTCGCCC CAGCCGGTGT CACTACCGAA GTTAAGTCCG TCGAAATGCA 700 
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CCACGAACAA TTGGAAGAAG GTCTACCAGG TGACAACGTC GGTTTCAACG x 750 

TCAAGAACGT TTCCGTTAAG GAAATCAGAA GAGGTAACGT CTGTGGTGAC 800 

TCCAAGAACG ATCCACCAAA GGCTGCTGCT TCTTTCAACG CCACTGTTAT 850 

CGTCTTGAAC CACCCAGGTC AAATCTCTGC TGGTTACTCT CCAGTTTTGG 900 

5 ATTGTCACAC TGCTCACATT GCTTGTAAGT TCGACGAATT GTTGGAAAAG 950 

AACGACAGAA GATCCGGTAA GAAGTTGGAA GACTCTCCAA AGTTCTTGAA 1000 

GTCTGGTGAC GCTGCTTTGG TTAAGTTCGT TCCATCTAAG CCAATGTGTG 1050 

TTGAAGCATT CTCTGACTAC CCACCATTGG GTAGATTCGC TGTCAGAGA 1099 



2) INFORMATION FOR SEQ ID NO: 422 



(i) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: 1095 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



2 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida krusei 

(B) STRAIN: ATCC 34135 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422 

AAGTTAAAGG CAGAAAGAGA AAGAGGTATC ACTATTGATA TTGCTTTATG 50 

GAAGTTYGAA ACTCCAAART ACCACGTTAC CGTTATTGAT GCTCCAGGTC 10 0 

3 0 ACAGAGATTT CATCAAGAAC ATGATTACCG GTACTTCTCA AGCTGATTGT 150 

GCTATTTTGA TTATTGCTGG TGGTGTCGGT GAATTCGAAG CTGGTATCTC 200 

CAAGGATGGT CAAACTAGAG AACACGCTCT ATTGGCTTTC ACCTTAGGTG 250 

TTAGACAATT GATTGTTGCT ATCAACAAGA TGGATTCCGT TAARTGGGAT 300 

GAAAACAGAT TTGAAGAAAT TGTCAAGGAA ACCCAAAACT TCATCAAGAA 350 

35 GGTTGGTTAC AACCCAAAGA CTGTTCCATT CGTTCCAATY TCTGGTTGGA 400 

ATGGTGACAA CATGATTGAA GCATCCACCA ACTGTCCATG GTACAAGGGT 450 

TGGACTAAGG AAACCAAGGC AGGTGTTGTT AAGGGTAAGA CCTTATTAGA 50 0 

AGCAATCGAT GCTATTGAAC CACCTGTCAG ACCAACCGAA AAGCCATTAA 550 

GATTACCATT ACAAGATGTT TACAAGATTG GTGGTATTGG TACTGTGCCA 600 

40 GTCGGTAGAG TCGAAACCGG TGTCATTAAG CCAGGTATGG TTGTCACTTT 650 

TGCTCCAGCA GGTGTCACCA CCGAAGTCAA GTCCGTTGAA ATGCACCATG 700 

AACAATTAGA ACAAGGTGTT CCAGGTGATA ACGTTGGTTT CAACGTTAAG 750 

AACGTTTCTG TCAAGGATAT CAAGAGAGGT AACGTTTGTG GTGACTCCAA 800 

GAACGACCCA CCAATGGGTG CAGCTTCCTT CAATGCTCAA GTCATTGTCT 850 

45 TGAACCACCC TGGTCAAATT TCCGCTGGTT ACTCTCCAGT CTTGGATTGT 900 

CACACTGCCC ACATTGCATG TAAGTTCGAC GAATTAATCG AAAAGATTGA 950 

CAGAAGAACT GGTAAGTCTG TTGAAGACCA TCCAAAGTCY GTCAAGTCTG 1000 

GTGATGCAGC TATCGTCAAG ATGGTCCCAA CCAAGCCAAT GTGTGTTGAA 1050 

GCTTTCACTG AATAYCCACC ATTAGGTAGA TTCGCAGTCA GAGAT 1095 

50 



2) INFORMATION FOR SEQ ID NO: 423 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1104 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

60 
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PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida Iambic a 
5 (B) STRAIN: ATCC 24750 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423 

CTTGGACAAG CTTAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

10 CTTTATGGAA GTTCGAAACT CCAAAGTACC ACGTTACCGT CATTGACGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAACATG ATTACTGGTA CCTCTCAAGC 150 

AGATTGTGCT ATTTTRATYA TTGCTGGTGG TGTCGGTGAA TTCGAAGCTG 200 

GTATCTCTAA GGATGGTCAA ACCAGAGAAC ACGCTCTTCT TGCATTCACT 250 

CTTGGTGTTA GACAATTGAT TGTTGCTATC AACAAGATGG ACTCTGTCAA 300 

15 GTGGGACGAA TCCAGATTCG ATGAAATTTG TAAGGAAACC GCWAACTTCA 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT CCCAATCTCT 400 

GGTTGGAACG GTGACAACAT GATTGAACCA TCTGCTAACT GTCCATGGTA 450 

CAAGGGATGG ACTAAGGAAA CCAAGGCTTC CGGTGTCGTC AAGGGTAAGA 500 

CCCTTCTTGA AGCAATTGAT GCTATTGAGC CACCTGTCAG ACCAACTGAC 550 

2 0 AAGGCTTTGA GATTGCCATT RCAAGATGTC TACAAGATTG GTGGTATTGG 600 

TACTGTGCCA GTCGGTAGAG TTGAAACCGG TATCAT CAAG CCAGGTATGA 650 

TTGTCGTTTT CGCTCCAACC GGTGTTACTA CTGAAGTTAA GTCCGTTGAA 700 

ATGCACCATG AACAATTAGA AGAAGGTGTC CCAGGTGACA ATGTTGGTTT 7 50 

CAACGTCAAG AACGTCTCTG TTAAGGATAT TAAGAGAGGT AACGTCTGTG 800 

25 GTGACTCCAA GAACGACCCA CCAATGGGTT GTGCTTCCTT CAATGCTCAA 850 

GTCATTGTTC TTAACCACCC AGGTCAAATT TCTGCTGGTT ACTCACCAGT 900 

TCTTGACTGT CACACTGCCC ACATTGCATG TAAGTTCGAY GAATTACTCG 950 

AAAAGATTGA CAGAAGAACC GGTAAGGCTA CTGAAGACCA TCCAAAGTCT 1000 
GTCAAGTCTG GTGATGCAGC TATCGTCAAG ATGGTTCCAA CCAAGCCAAT ' 1050 

3 0 GTGTGTYGAA GCTTTCACTG ACTACCCACC ATTAGGTAGA TTCGCTGTYA 1100 

GAGA 1104 



3 5 2 ) INFORMATION FOR SEQ ID NO: 424 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida lusitaniae 

(B) STRAIN: ATCC 66035 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424 



GGACAAGTTG AAGGCTGAGA GAGAAAGAGG TATCACCATC GATATCGCTT 50 

TGTGGAAGTT CGAGACTCCA AAGTACCACG TTACCGTCAT TGACGCTCCA 100 

GGTCACAGAG ATTTCATCAA GAACATGATC ACTGGTACTT CCCAAGCTGA 150 

CTGTGCTATC TTGATTATCG CTGGTGGTGT CGGTGAGTTC GAAGCCGGTA 2 00 

55 TCTCTAAGGA CGGTCAAACC AGAGAGCACG CTTTGTTGGC TTACACCTTG 250 

GGTGTCAAGC AGTTGATTGT TGCTGTCAAC AAGATGGACT CCGTCAAGTG 3 00 

GGACCAATCT AGATTCGAGG AAATCATCAA GGAAACCTCT AACTTCGTCA 3 50 

AGAAGGTTGG TTACAACCCT AAGACTGTTC CATTCGTCCC AATCTCTGGT 4 00 

TGGAACGGTG ACAACATGAT TGAGCCATCY ACCAACTGCC CATGGTACAA 450 

60 GGGTTGGGAG AAGGAGACCA AGTCYGGTAA GTCCACCGGT AAGACCTTGT 500 
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TGGAGGCCAT TGACGCCATT GAGCCACCTT CGAGACCAAC CGACAAGCCX 55 & 

TTGAGATTGC CATTGCAAGA TGTCTACAAG ATYGGTGGTA TTGGTACTGT 600 

GCCAGTCGGT AGAGTTGAGA CCGGTGTCAT CAAGGCCGGT ATGGTTGTCA 650 

CCTTTGCTCC AGCTGGTGTC ACCACTGAAG TCAAGTCCGT GGAAATGCAC 700 

5 CACGAACAAT TGGCTGAGGG TGTCCCAGGT GACAACGTTG GTTTCAACGT 750 

CAAGAACGTT TCCGTCAAGG AAATCAGAAG AGGTAACGTC TGTGGTGACT 800 

CCAAGAACGA CCCACCAAAG GCTGCTGCTT CYTTCACTGC TCAAGTYATY 850 

GTCTTGAACC ACCCAGGTCA AATCTCCTCY GGTTACTCTC CAGTYTTGGA 900 

CTGTCACACT GCYCACATTG CTTGTAAGTT CGACACCTTG ATTGAGAAGA 950 

10 TCGACAGAAG AACYGGTAAG AAGTTGGAAG AAGAGCCAAA GTTCATCAAG 1000 

TCYGGTGACG CTGCTATCGT CAAGATGGTC CCAACCAAGC CAATGTGYGT 1050 

YGAAGCTTTC ACCGACTACC CACCATTGGG TAGATTCGCT GTCAGAGA 1098 



15 



2) INFORMATION FOR SEQ ID NO: 425 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1101 bases 
2 0 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida norvegensis 

(B) STRAIN: ATCC 22977 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 425 

TCTTGACAAG TTAAAGGCTG AAAGAGAAAG AGGTATCACT ATTGATATTG 50 

CTTTATGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT TATTGATGCT 100 

CCAGGTCACA GAGATTTCAT TAAGAACATG ATTACTGGTA CTTCCCAAGC 150 

35 TGATTGTGCT ATCTTAATTA TTGCTGGTGG TGTCGGTGAA TTCGAAGCTG 2 00 

GTATCTCCAA GGATGGTCAA ACCAGAGAAC ACGCTTTATT AGCATTCACC 250 

TTAGGTGTTA AGCAATTAAT TGTTGCTATC AACAAGATGG ACTCTGTTAA 300 

GTGGGATGAA AAGAGATTTG AAGAAATTGT CAAGGAAACC CAAAACTTCA 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATTTCT 400 

40 GGTTGGAATG GTGACAACAT GATTGAACCA TCTACTAACT GTCCATGGTA 450 

CAAGGGTTGG ACTAAGGAAA CCAAGGCAGG TGTTGTTAAG GGTAAGACCT 500 

TATTAGAAGC TATTGATGCT ATTGAACCAC CTGTCAGACC AACTGACAAG 550 

CCATTAAGAT TACCATTACA AGATGTTTAC AAGATTGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACCGGTGT TATTAAGCCA GGTATGGTTG 650 

45 TTGTTTTCGC ACCATCTGGT GTTACCACTG AAGTCAAGTC CGTTGAAATG 700 

CACCATGAAC AATTAGAACA AGGTGTCCCA GGTGACAATG TTGGTTTCAA 750 

CGTCAAGAAC GTCTCTGTTA AGGATATTAA GAGAGGTAAC GTTTGTGGTG 800 

ACTC C AAGAA CGACCCACCA CAAGGTTGTG CTTCCTTCAA TGCTCAAGTC 850 

ATTGTCTTGA ACCACCCTGG TCAAATTTCT GCAGGTTACT CTCCAGTTTT 900 

50 AGATTGTCAC ACTGCCCACA TTGCATGTAA GTTCGATGAA TTAATCGAAA 950 

AGATTGACAG AAGAACTGGT AAGTCCGTTG AAGACCATCC AAAGTCTGTT 1000 

AAGTCTGGTG ATGCAGCTAT CGTTAAGATG GTTCCAACCA AGCCAATGTG 1050 

TGTTGAAGCT TTCACTGAAT ACCCACCATT AGGTAGATTC GCAGTCAGAG 1100 

A 1101 

55 

2) INFORMATION FOR SEQ ID NO: 426 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1095 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida parapsilosis 
10 (B) STRAIN: ATCC 90018 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426 

CAAATTGAAG GCTGAAAGAG AAAGAGGTAT CACCATTGAT ATCGCTTTGT 50 

15 GGAAATTCGA AACTCCAAAA TACCATGTTA CTGTTATTGA TGCTCCAGGT 100 

CACAGAGATT- TCATCAAGAA TATGATTACT GGTACTTCTC AAGCTGATTG 150 

TGCTATTTTG ATTATTGCTG GTGGTACTGG TGAATTCGAA GCTGGTATCT 200 

CTAAGGATGG TCAAACCAGA GAACACGCTT TGTTGGCTTA CACCTTGGGT 250 

GTTAAGCAAT TGATTGTTGC CATCAACAAG ATGGACTCAG TCAAATGGGA 300 

20 CAAGAACAGA TACGAAGAAA TTGTCAAGGA AACTTCCAAC TTCGTCAAGA 350 

AGGTTGGTTA CAACCCTAAA GCTGTCCCAT TCGTCCCAAT CTCTGGTTGG 400 

AACGGTGACA ATATGATTGA ACCATCAACC AACTGTCCAT GGTACAAGGG 450 

TTGGGAAAAG GAAACTAAAG CTGGTAAGGT TACCGGTAAG ACCTTGTTGG 500 

AAGCTATCGA TGCTATCGAA CCACCAACCA GACCAACTGA CAAGCCATTG 550 

25 AGATTGCCAT TGCAAGATGT CTACAAGATT GGTGGTATTG GAACTGTGCC 600 

AGTTGGTAGA GTTGAAACCG GTATCATCAA GGCTGGTATG GTTGTTACTT 650 

TTGCCCCAGC TGGTGTTACC ACTGAAGTCA AGTCCGTTGA AATGCACCAC 700 

GAACAATTGA CTGAAGGTGT CCCAGGTGAC AATGTTGGTT TCAACGTCAA 750 

GAACGTTTCA GTTAAGGAAA TCAGAAGAGG TAACGTTTGT GGTGACTCCA 800 

30 AGAACGATCC ACCAAAGGGA TGTGAYTCCT TCAATGCTCA AGTTATTGTC 850 

TTGAACCACC CAGGTCAAAT CTCTGCTGGT TACTCACCAG TCTTGGATTG 900 

TCACACTGCC CACATTGCTT GTAAATTCGA CACTTTGATT GAAAAGATTG 950 

ACAGAAGAAC CGGTAAGAAA TTGGAAGATG AACCAAAATT CATCAAGTCC 1000 

GGTGATGCTG CYATCGTCAA GATGGTCCCA AC C AAG CCAA TGTGTGTTGA 1050 

35 AGCTTTCACT GACTACCCAC CATTGGGAAG ATTCGCTGTT AGAGA 1095 



2) INFORMATION FOR SEQ ID NO: 427 



40 



( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 752 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Candida rugosa 

(B) STRAIN: ATCC 96275 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427 

55 CTCCGTCAAG TGGTCTCAGT CTCGTTTCGA GGAGATCGTC AAGGAGGTTT 50 

CCAACTTCAT CAAGAAGGTT GGTTACAAGC CCGATGAGGT TCCTTTCGTC 100 

CCCATCTCTG GCTGGAACGG CGACAACATG CTTGAGCCCT CCACCAACTG 150 

CCCCTGGTAC AAGGGATGGA CCAAGAAGAC CAAGAAGGGT GAGGTCAAGG 200 

GTAAGACTCT TCTCGAGGCC ATTGACGCCA TCGAGCCCCC CTCCCGTCCT 250 

60 ACCGACAAGC CCCTCCGCTT GCCTCTTCAG GATGTCTACA AGATCGGCGG 3 00 

258 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TATCGGTACG GTACCTGTCG GCCGTGTCGA 
GCATGGTCGT CACTTTCGCC CCCGCTGGTG 
GTCGAGATGC ACCACGAGCA GATCCCCGAG 
CGGTTTCAAC GTCAAGAACG TTACCGTCAA 
TCTGCGGTGA CTCCAAGAAC GACCCCCCCA 
GCCCAGGTCA TCGTTTTCAA CCACCCCGGT 
CCCCGTTTTG GACTGCCACA CCGCCCACAT 
TCCAGTCCAA GATGGACCGT CGTACTGGTA 
AAGTTCATCA AGGCTGGTGA CTCCGCTATC 
GC 



PCT/CA00/01150 

GACCGGTATC ATCAAGCCCG 350 

TCACCACTGA AGTGAAGTCC 400 

GGTCTCCCCG GTGACAACGT 450 

GGATATCCGC CGTGGTAACG 500 

AGGGCTGCTC TTCCTTCACT 550 

CAGATCTCCA ACGGTTACTC 600 

TGCCTGCCGC TTCGACGAGA 650 

AGACCCTTGA GGAGAACCCC 700 

GTCAAGATGG TTCCCTCCAA 750 

752 



2) INFORMATION FOR SEQ ID NO: 42 8 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1093 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida sphaerica 

(B) STRAIN: ATCC 2504 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428 

AGTTAAAGGC TGAAAGAGAA AGAGGTATCA CCATCGATAT CGCTTTGTGG 50 

AAGTTCGAAA CTCCAAAGTA CCAAGTTACC GTTATCGATG CTCCAGGTCA 100 

CAGAGATTTC ATCAAGAACA TGATTACTGG TACTTCTCAA GCTGACTGTG 150 

CTATCTTGAT TATTGCTGGT GGTGTCGGTG AATTCGAAGC CGGTATCTCC 2 00 

AAGGATGGTC AAACCAGAGA ACACGCTTTG TTGGCTTTCA CCTTGGGTGT 250 

TAGACAATTG ATTGTTGCTG TTAACAAGAT GGATTCCGTT AAGTGGGATG 300 

AATCTCGTTT CCAAGAAATT GTCAAGGAAA CCTCTAACTT CATCAAGAAG 350 

GTCGGTTACA ACCCAAAGAC TGTTCCATTC GTCCCAATCT CTGGTTGGAA 4 00 

CGGTGACAAC ATGATTGAAG CCACCACCAA TGCTTCATGG TACAAGGGTT 450 

GGGAAAAGGA AACCAAGTCC GGTGTCGTCA AGGGTAAGAC CTTGTTGGAA 500 

GCTATTGACG CTATCGAACC ACCATCCAGA CCAACTGACA AGCCATTGAG 550 

ATTGCCATTG CAAGATGTCT ACAAGATTGG TGGTATCGGA ACTGTGCCAG 600 

TCGGTAGAGT CGAAACCGGT GTTATCAAGC CAGGTATGAT TGTTACCTTT 650 

GCCCCAGCCG GTGTTACTAC TGAAGTTAAG TCCGTCGAAA TGCACCACGA 700 

ACAATTGGAA GAAGGTCTAC CAGGTGACAA CGTCGGTTTC AACGTCAAGA 750 

ACGTTTCCGT TAAGGAAATC AGAAGAGGTA ACGTCTGTGG TGACTCCAAG 800 

AACGATCCAC CAAAGGCTGC TGCTTCTTTC AACGCCACTG TTATCGTCTT 850 

GAACCATCCA GGTCAAATCT CTGCTGGTTA CTCTCCAGTT TTGGATTGTC 900 

ACACTGCTCA CATTGCTTGT AAGTTCGACG AATTGTTGGA AAAGAACGAT 950 

AGAAGATCCG GTAAGAAGTT GGAAGACTCT CCAAAGTTCT TGAAGTCCGG 1000 

TGATGCTGCT TTGGTTAAGT TCGTTCCATC TAAGCCAATG TGTGTTGAAG 1050 

CCTTCTCTGA CTACCCACCT CTAGGTAGAT TCGCTGTCAG AGA 1093 



2) INFORMATION FOR SEQ ID NO: 429 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1094 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

259 



SUBSTITUTE SHEET (RULE 2S) 



WO 01/23604 PCT/C A00/01 150 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida tropicalis 

(B) STRAIN: ATCC 13803 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 429 

AAATTGAAGG CTGAAAGAGA AAGAGGTATC ACCATTGATA TCGCTTTGTG 50 

GAAATTCGAA AC TCCAAAAT ACCACGTTAC CGTTATTGAT GCTCCAGGTC 100 

ACAGAGATTT CATCAAGAAC ATGATTACTG GTACTTCCCA AGCTGATTGT 150 

GCTATTTTGA TTATTGCTGG TGGTACTGGT GAATTCGAAG CTGGTATTTC 200 

TAAAGATGGT CAAACCAGAG AACACGCTTT GTTGGCTTAC ACCTTGGGTG 250 

TCAAACAATT GATTGTTGCT GTCAACAAGA TGGACTCTGT TAAATGGGAC 3 00 

AAAAACAGAT TTGAAGAAAT TATCAAGGAA ACTTCTAACT TCGTCAAGAA 3 50 

GGTTGGTTAC AACCCTAAGG CTGTTCCATT CGTTCCAATC TCTGGTTGGA 400 

ATGGTGACAA CATGATTGAA GCTTCTACCA ACTGTCCATG GTACAAGGGT 450 

TGGGAAAAAG AAACCAAGGC TGGTAAGGTT ACCGGTAAGA CTTTGTTGGA 500 

AGCCATTGAT GCTATTGAAC CACCTTCAAG ACCAACTGAC AAGC CATTGA 550 

GATTGCCATT GCAAGATGTT TACAAGATTG GTGGTATTGG TACTGTGCCA 600 

GTCGGTAGAG TTGAAACTGG TGTCATCAAA GCCGGTATGG TTGTTACTTT 650 

CGCCCCAGCT GGTGTTACCA CTGAAGTCAA ATCCGTCGAA ATGCACCACG 700 

AACAATTGGC TGAAGGTGTC CCAGGTGACA ATGTTGGTTT CAACGTTAAG 750 

AACGTTTCTG TTAAAGAAAT TAGAAGAGGT AACGTTTGTG GTGACTCCAA 800 

GAACGATCCA CCAAAGGGTT GTGACTCTTT CAACGCTCAA GTTATTGTCT 850 

TGAACCACCC AGGTCAAATC TCTGCTGGTT ACTCTCCAGT CTTGGATTGT 900 

CACACTGCTC ATATTGCTTG TAAATTCGAC ACCTTGGTTG AAAAGATTGA 950 

CAGAAGAACT GGTAAGAAAT TGGAAGAAAA TCCAAAATTC GTCAAATCCG 1000 

GTGATGCTGC TATTGTCAAG ATGGTTCCAA CCAAACCAAT GTGTGTTGAA 1050 

GCTTTCACTG ACTACCCACC ATTAGGTAGA TTCGCTGTCA GAGA 1094 



2) INFORMATION FOR SEQ ID NO: 43 0 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1095 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida utilis 

(B) STRAIN: Csp 388 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430 

CAAGCTTAAA GCTGAGAGAG AGAGAGGTAT CACTATCGAC ATTGCTCTCT 50 

GGAAGTTCGA GACTCCAAAG TACCACGTTA CTGTCATTGA TGCCCCAGGT 100 

CACAGAGATT TCATCAAGAA CATGATTACT GGTACCTCCC AGGCTGACTG 150 

TGCTATTCTT ATCATTGCCG GTGGTGTTGG TGAGTTCGAG GCTGGTATCT 200 

CTAAGGATGG TCAGACCAGA GAGCACGCTT TGCTCGCTTT CACCCTTGGT 250 

GTTAGACAGA TGATTGTTGC TATCAACAAG ATGGACTCTG TCAAGTGGGA 300 

CGAGAAGAGA TTCGAGGAGA TCGTTAAGGA GACCTCTAAC TTCATCAAGA 350 

AGGTTGGTTA CAACCCAAAG ACTGTTCCAT TTGTCCCAAT TTCYGGTTGG 400 

AACGGTGACA ACATGATTGA GGCCTCTACC AACTGTCCAT GGTACAAGGG 450 

260 
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TTGGGAGAAG GAGACCAAGG CTGGTGTTGT CAAGGGTAAG ACCTTGCTCG ' &6 

ATGCCATTGA CGCCATTGAG CCACCAACAA GACCAACTGA CAAGCCATTG 550 

AGATTGCCAC TCCAGGATGT CTACAAGATT GGTGGTATCG GAACTGTTCC 600 

AGTCGGTAGA GTCGAGACCG GTGTCATCAA GCCAGGTATG GTTGTTACCT 650 

5 TTGCCCCATC CGGTGTCACC ACTGAGGTTA AGTCCGTCGA GATGCACCAC 700 

GAGCAGCTTG CTGAGGGTAT CCCAGGTGAC AACGTTGGTT TCAACGTTAA 750 

GAACGTCTCT GTTAAGGAGA TCAGAAGAGG TAACGTTGCC GGTGACTCCA 800 

AGAACGACCC ACCACAGGGT GCTGAGTCCT TCAACGCTCA GGTCATTGTC 850 

TTGAACCACC CAGGTCAGAT CTCTGCTGGT TACTCTCCAG TTTTGGACTG 900 

10 TCACACCGCC CACATTGCTT GTAAGTTCTC TGAGCTTTTG GAGAAGATTG 950 

ACAGAAGATC CGGTAAGTCC CTTGAGGCCT CTCCAAAGTT CGTCAAGTCT 1000 

GGTGATGCCG CTATCGTCAA GATGGTTCCA TCCAAGCCAT TGTGTGTTGA 1050 

GGCCTTCACT GACTACCCAC CACTCGGTAG ATTCGCTGTC AGAGA 1095 



15 



30 



2) INFORMATION FOR SEQ ID NO: 431 



( i ) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 1085 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida viswanathii 

(B) STRAIN: ATCC 28269 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 431 



GCTGAAAGAG AAAGAGGTAT CACCATCGAT ATCGCTTTGT GGAAATTCGA 50 

AACTCCAAAR TACCACGTTA CCGTCATTGA YGCTCCAGGT CACAGAGATT 100 

35 TCATCAAGAA CATGATYACT GGTACTTCTC AAGCTGATTG TGCTATYTTG 150 

ATTATCGCTG GTGGTACTGG TGAATTCGAA GCTGGTATYT CTAAGGATGG 2 00 

TCAAACCAGA GAACACGCTT TGTTGGCCTA CACCTTGGGT GTCAAGCAAT 250 

TGATTGTTGC TGTCAACAAG ATGGACTCTG TCAAATGGGA CAAGAACAGA 3 00 

TTCGAAGAAA TCATCAAGGA AACCTCCAAC TTCGTCAAGA AGGTTGGTTA 350 

40 CAACCCAAAG ACTGTTCCAT TCGTCCCAAT CTCTGGTTGG AACGGTGACA 400 

ACATGATTGA AGCCTCCACC AACTGCCCAT GGTACAAGGG TTGGGAAAAG 450 

GAAACCAAGG CTGGTAAGGT TACCGGTAAG ACTTTGTTGG AAGCCATTGA 500 

CGCTATCGAA CCACCAACCA GACCAACTGA CAAGCCATTG AGATTGCCAT 550 

TGCAAGATGT CTACAAGATT GGTGGTATCG GAACTGTGCC AGTCGGTAGA 600 

45 GTTGAAACTG GTGTCATCAA GGCCGGTATG GTTGTCACTT TYGCCCCAGC 650 

TGGTGTTACC ACTGAAGTCA AGTCCGTTGA AATGCACCAC GAACAATTGG 700 

CTGAAGGTGT CCCAGGTGAC AACGTTGGTT TCAACGTCAA GAACGTTTCC 750 

GTCAAGGAAA TCAGAAGAGG TAACGTCTGT GGTGACTCCA AGAACGACCC 800 

ACCAAAGGGT TGTGASTCTT TCAACGCTCA AGTCATTGTC TTGAACCACC 850 

50 CAGGTCAAAT CTCTGCTGGT TACTCTCCAG TCTTGGATTG TCACACTGCC 900 

CACATTGCTT GTAAGTTTGA CACCTTGGTT GAAAAGATTG ACAGAAGAAC 950 

CGGTAAGAAG TTGGAAGAAA ACCCAAAGTT TGTCAAGTCC GGTGACGCTG 1000 

CTATCGTCAA GATGGTCCCA ACCAAGCCAA TGTGTGTTGA AGCYTTCACT 1050 

GACTACCCAC CATTGGGTAG ATTCGCTGTC AGAGA 1085 

55 



2) INFORMATION FOR SEQ ID NO: 432 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1072 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida zeylanoides 

(B) STRAIN: ATCC 7351 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 432 

AGGTATTACC ATTGACATTG CCTTGTGGAA GTTCGAGACC CCCAAGTACC 50 

AGGTCACCGT CATTGACGCT CCTGGCCACA GAGATTTCAT TAAGAACATG .100 

ATCACTGGTA CCTCCCAGGC TGACTGTGCC ATCTTGATCA TTGCTGGTGG 150 

TGTTGGTGAG TTCGAGGCTG GTATCTCCAA GGATGGCCAG ACCAGAGAGC 200 

ACGCCTTGCT TGCCTACACC TTGGGTGTCA AGCAATTGAT TGTTGCTGTC 250 

AACAAGATGG ACTCCGTCAA GTGGGACAAG AACAGATTCG AGGAGATTGT 300 

CAAGGAGACC TCCAACTTCG TCAAGAAGGT TGGCTACAAC CCCAAGACTG 350 

TCCCCTTCGT TCCCATCTCC GGTTGGAACG GTGACAACAT GATTGAGGCC 400 

TCCACCAACT GCCCTTGGTA CAAGGGTTGG GAGAAGGAGA CCAAGGCCGG 450 

TAAGGTCACT GGTAAGACCT TGTTGGAGGC TATTGACGCC ATTGAGCCCC 500 

CCACCAGACC CACCGACAAG CCCTTGAGAT TGCCCTTGCA GGATGTCTAC 550 

AAGATTGGTG GTATTGGAAC GGTGCCCGTT GGCAGAGTTG AGACCGGCAT 600 

CATCAAGGCC GGTATGGTTG TCACCTTTGC CCCCGCTGGT GTCACTACTG 650 

AAGTGAAGTC TGTCGAGATG CACCACGAGC AATTGGCTGA GGGTGTCCCA 700 

GGTGACAATG TTGGTTTCAA CGTGAAGAAC GTTTCCGTTA AGGAGATCAG 750 

AAGAGGTAAC GTTTGCGGTG ACTCCAAGAA CGACCCCCCC AAGGCTGCTG 800 

CTTCTTTCAA CGCCCAGGTT ATCGTCTTAA ACCACCCCGG TCAAATCTCT 850 

GCTGGTTACT CTCCGGTTTT GGATTGCCAC ACTGCCCACA TTGCTTGCAG 900 

ATTCGACCAG TTGATTGAGA AGATCGACAG AAGAACCGGT AAGAAGATGG 950 

AGGACGACCC TAAGTTCATC AAGTCCGGTG ACGCTGCCAT CGTCAAGATG 1000 

GTTCCTTCCA AGCCCATGTG TGTTGAGGCC TTCACTGACT ACCCTCCCTT 1050 

GGGTCGTTTC GCTGTCAGAG AC 1072 



2) INFORMATION FOR SEQ ID NO: 433 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coccidioides immitis 

(B) STRAIN: Silveira 



(xi) SEQUENCE DESCRIPTION: SEQ ID 



NO: 433 



AGCACCAACT GGTCCGAGCC TCGTTTCAAC GAAATCGTCA 
CAACTTCATC AAGAAGGTCG GATACAACCC CAAGGCTGTT 
CCATCTCTGG TTTCGAAGGT GACAACATGA TTCAACCCTC 
CCTTGGTACA AGGGCTGGAA CAAGGAGACC GCCTCTGGCA 
CAAGACCCTC CTCGACGCCA TTGATGCCAT CGACCCCCCA 
CCGAGAAGCC CCTCCGTCTC CCACTTCAGG ATGTGTACAA 



AGGAAGTCTC 
CCATTCGTCC 
CACCAACGCT 
AGCACACTGG 
ACCCGCCCCA 
GATCTCTGGT 



50 
100 
150 
200 
250 
300 



262 
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ATCGGAACAG TCCCAGTCGG CCGTGTCGAA 
TATGGTTGTG ACCTTCGCTC CTTCCAACGT 
TCGAAATGCA CCACCAGCAG CTCACCCAGG 
GGCTTCAACG TCAAGAACGT CTCTGTCAAG 
5 CGCTGGTGAC TCCAAGAACG ACCCACCAAA 
CCCAGGTCAT CGTCCTCAAC CACCCTGGTC 
CCAGTCCTTG ACTGCCACAC TGCCCACATT 
CCTCGAGAAG ATCGACCGCC GTACCGGTAA 
AGTTCATCAA GTCTGGTGAT GCCGCTATCG 



PCT/CA00/01150 

ACCGGTGTTA TCAAGCCTGG* J50 

CACCACTGAA GTCAAGTCCG 400 

GTAACCCTGG TGACAACGTT 4 50 

GAAGTCCGCC GCGGTAACGT 500 

GGGCTGCGAC TCCTTCAACG 550 

AAGTCGGTGC TGGTTATGCC 600 

GCTTGCAAGT TCTCCGAGCT 650 

ATCCGTTGAG AACAACCCCA 700 

TCAAGATGGT TCCATCCAAG 750 

751 



2 ) INFORMATION FOR SEQ ID NO: 434 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Cryptococcus albidus 

(B) STRAIN: ATCC 66030 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434 

30 AAGCTCAAGG CCGAGCGAGA GCGAGGTATC ACCATCGACA TCGCCTTGTG 50 

GAAGTTCGAG ACCCCCAAGT ACAATGTCAC CGTCATTGAC GCCCCCGGTC 100 

ACCGAGACTT CATCAAGAAC ATGATCACCG GTACCTCGCA GGCCGACTGT 150 

GCCATCCTCA TCATCGCCTC CGGTATCGGA GAGTTCGAGG CTGGTATCTC 200 

CAAGGACGGT CAGACCCGAG AGCACGCCCT TTTGGCCTTC ACCCTCGGTG 250 

35 TCCGACAGCT CATCATTGCC ATCAACAAGA TGGACACCTG CAAGGTTAGT 300 

TCGCAGGTCC TGGTCTCTGT ACGAATCTTG CTGACCCCTT TTACAGTGGT 350 

CCGAAGACCG ATACAACGAA ATCGTCAAGG AGGCTTCCGG TTTCATCAAG 4 00 

AAGGTCGGAT ACAACCCCAA GACCGTTCCC TTCGTCCCCA TCTCCGGATG 450 

GCACGGAGAC AACATGTTGG AGGAGTCCAC CAACATGCCC TGGTACAAGG 500 

4 0 GATGGCACAA GGAGTCCAAG GCCGGTGTTG TCAAGGGAAA GACCTTGCTC 550 

GAGGCCATCG ACGCCATCGA GCCCCCTACC CGACCTTCCG ACAAGCCCTT 600 

GCGATTGCCC CTCCAGGATG TCTACAAGAT CGGTGGTATC GGTACGGTGC 650 

CCGTCGGTCG AGTCGAGACC GGTGTCATCA AGGCTGGTAT GGTCGTCACC 700 

TTCGCCCCTG CCAACGTCAC CACCGAAGTC AAGTCCGTCG AAATGCACCA 7 50 

45 CGAACAGCTC GCTGAGGGTG TTCCCGGTGA CAACGTCGGT TTCAACGTCA 800 

AGAACGTTTC CGTCAAGGAC ATCCGACGAG GAAACGTCTG CTCCGACTCG 850 

AAGAACGACC CCGCTATGGA GTCTGCTTCC TTCAACGCTC AGGTCATTGT 900 

CTTGAACCAC CCGGGTCAGA TCGGTGCCGG CTACTCCCCC GTTTTGGACT 950 

GCCACACCGC TCACATTGCC TGCAAGTTCG CTGAGCTCGT TGAGAAGATC 1000 

50 GACCGACGAA CCGGTAAGGT CATGGAGGCC GCCCCCAAGT TCGTCAAGTC 1050 

CGGTGACGCC GCCATCGTCA AGTTGATCCC GTCCAAGCCC ATGTGTGTCG 1100 

AGTCCTACTC CGAGTACCCC CCCTTGGGTC GATTCGCCGT CCGAGA 114 6 



55 

2) INFORMATION FOR SEQ ID NO: 435 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1095 bases 
60. (B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Exophiala jeanselmei 

(B) STRAIN: ATCC 64755 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435 

CAAGCTGAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCCTTGT 50 

GGAAGTTCGA GACTCCCAAG TACTATGTCA CTGTCATCGA CGCCCCTGGT 100 

CATCGTGACT TTATCAAGAA CATGATCACT GGTACTTCCC AGGCTGACTG 150 

15 CGCCATTCTC ATCATTGCCG CCGGTACTGG TGAATTCGAA GCCGGTATCT 2 00 

CCAAGGATGG TCAGACTCGT GAGCACGCTC TGCTCGCCTA CACCCTGGGT 250 

GTCAAGCAGC TCATTGTCGC CATCAACAAG ATGGACACCA CCAAGTGGTC 300 

CGAGGATCGT TTCAACGAAA TCATCAAGGA GACTTCCAGC TTCATCAAGA 3 50 

AGGTCGGCTA CAACCCCAAG TCCGTTCCTT TCGTCCCCAT CTCCGGCTTC 4 00 

20 AACGGTGACA ACATGATCGA TGTCTCCACC AACTGCCCCT GGTACAAGGG 450 

CTGGGAGAAG GAGACCAAGG CTGGCAAGGC CTCTGGCAAG ACTCTCCTCG 500 

AGGCCATCGA CGCCATCGAC CCCCCCACTC GTCCCACCGA CAAGCCTCTC 550 

CGTCTTCCTC TCCAGGATGT CTACAAGATC TCTGGTATCG GAACGGTGCC 600 

CGTCGGTCGT GTTGAGACTG GTGTCATCAA GGCCGGTATG GTCGTTACCT 650 

25 TCGCTCCTGC CAACGTCACC ACTGAAGTCA AGTCCGTCGA AATGCACCAC 700 

GAACAACTCG CCGAGGGTGT TCCAGGTGAC AACGTTGGTT TCAACGTCAA 750 

GAACGTCTCC GTCAAGGAGG TTCGTCGTGG AAACGTCTGC GGTGACTCCA 8 00 

AGAACGACCC ACCCAAGGGT GCTGATTCCT TCAACGCCCA GGTCATCGTC 850 

TTGAACCACC CTGGTCAAGT CGGTGCTGGC TACGCCCCAG TGTTGGATTG 900 

3 0 CCACACTGCC CACATTGCTT GCAAGTTCTC TGAGCTTCTC GAGAAGATTG 950 

ACCGCCGTAC CGGTAAATCC ATCGAAAACA ACCCCAAGTT CATCAAGTCT 1000 

GGTGACGCTG CCATCGTCAA GATGGTTCCC AGCAAGCCCA TGTGTGTTGA 1050 

GGCCTTCACT GACTACCCAC CTCTTGGTCG TTTCGCCGTC CGTGA 1095 



35 



2) INFORMATION FOR SEQ ID NO: 436 

(i) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 1113 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 



50 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium oxysporum 

(B) STRAIN: WSA-212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 436 



AAGCTCAAGG CCGAGCGTGA GCGTGGTATC ACCATCGATA TTGCTCTCTG 50 

GAAGTTCGAG ACTCCTCGCT ACTATGTCAC CGTCATTGGT ATGTTGTCGC 100 

55 TCATGCTTCA TTCTACTTCT CTTCGTACTA ACACATCACT CAGACGCTCC 150 

CGGTCACCGT GATTTCATCA AGAACATGAT CACTGGTACT TCCCAGGCCG 200 

ATTGCGCCAT TCTCATCATT GCCGCCGGTA CTGGTGAGTT CGAGGCTGGT 250 

ATCTCCAAGG ATGGCCAGAC CCGTGAGCAC GCTCTTCTTG CCTACACCCT 3 00 

TGGTGTCAAG AACCTCATCG TCGCCATCAA CAAGATGGAC ACCACCAAGT 350 

60 GGTCTGAGGC CCGTTACCAG GAGATCATCA AGGAGACCTC CTCTTTCATC 4 00 

264 
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AAGAAGGTCG GCTACAACCC CAAGGCTGTC GCTTTCGTCC CCATCTCCGG 450 

TTTCAACGGT GACAACATGC TTACCCCCTC CACCAACTGC CCCTGGTACA 500 

AGGGTTGGGA GCGTGAGATC AAGTCCGGCA AGCTCACTGG CAAGACCCTC 550 

CTCGAGGCCA TTGACTCCAT CGAGCCCCCC AAGCGTCCCG TTGACAAGCC 600 

5 CCTTCGTCTT CCCCTTCAGG ATGTCTACAA GATCGGTGGT ATTGGAACGG 650 

TTCCCGTCGG CCGTATCGAG ACTGGTGTCA TCAAGCCCGG TATGGTCGTT 700 

ACCTTCGCTC CTTCCAACGT CACCACTGAA GTCAAGTCCG TCGAGATGCA 750 

CCACGAGCAA CTCACTGAGG GCCAGCCCGG TGACAACGTT GGTTTCAACG 800 

TGAAGAACGT CTCCGTCAAG GACATCCGAC GTGGTAACGT CGCTGGTGAC 850 

10 TCCAAGAACG ACCCCCCTAT GGGTGCCGCT TCTTTCACCG CCCAGGTCAT 900 

CGTCCTCAAC CACCCCGGTC AGGTCGGTGC TGGTTACGCT CCCGTCCTCG 950 

ATTGTCACAC TGCCCACATT GCCTGCAAGT TCGCCGAGAT CCAGGAGAAG 1000 

ATCGACCGCC GAACCGGTAA GGCTACTGAG GCCGCCCCCA AGTTCATCAA 1050 

GTCTGGTGAC TCCGCCATCG TCAAGATGGT TCCCTCCAAG CCCATGTGTG 1100 

15 TTGAGGCTTT CAC 1113 



2) INFORMATION FOR SEQ ID NO: 437 



20 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 726 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
3 0 (A) ORGANISM: Geotrichum spp. 

(B) STRAIN: LEV-4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437 

35 GGTCCGAGGA CAGATTCAAC GAGATTGTCA AGGAGACTTC CAACTTCATC 50 

AAGAAGGTTG GTTAYAACCC CAAGACTGTT GCTTTCGTCC CCATCTCTGG 100 

TTGGAACGGT GACAACATGA TTGAGCCCTC CACCAACTGC CCCTGGTACA 150 

AGGGATGGCA GAAGGAGACC AAGGCTGGTG TCACTAAGGG TAAGACCCTC 2 00 

CTTGAGGCCA TCGATGCCAT TGAGCCCCCT GTCAGACCTT CCGACAAGCC 250 

40 CCTCCGTCTT CCCCTCCAGG ATGTCTACAA GATCGGTGGT ATCGGAACTG 3 00 

TGCCCGTCGG CCGTGTCGAA ACCGGTGTCA TCAAGGCCGG TATGGTCGTC 350 

ACCTTCGCCC CCGCTGGTGT CACCACTGAA GTCAAGTCCG TCGAGATGCA 4 00 

CCACGAGCTC CTCACTGAGG GTCTCCCCGG TGACAACGTT GGTTTCAACG 450 

TCAAGAACGT CTCCGTTAAG GATATCAGAC GTGGTAACGT CTGCGGTGAC 500 

45 TCCAAGAACG ATCCCCCCAA GGCTTGCGCT TCTTTCAACG CCCAGGTCAT 550 

TATCTTCAAC CACCCTGGTC AGATCTCTGC TGGATACTCT CCCGTCCTTG 600 

ATTGCCACAC CGCCCATATT GCTTGCAAGT TCGACACTTT GATCGAGAAG 650 

ATTGACCGTC GTACTGGTAA GAAGACTGAG GACTCCCCCA AGTTCGTCAA 700 

GGCCGGTGAT GCTGCTATCG TCAAGA 726 

50 



2) INFORMATION FOR SEQ ID NO: 438 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 754 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 

265 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulatum 
5 (B) STRAIN: G186A5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438 

CACCACCAAG TGGTCCGAGT CCCGTTTCAA CGAAATCATC AAGGAGGTTT 50 

10 CCAACTTCAT CAAGAAGGTC GGATATAACC CCAAGGCTGT TCCCTTCGTG 100 

CCAATCTCTG GTTTCGAGGG TGACAACATG ATTGAACCCT CCCCCAACTG 150 

CACATGGTAC AAGGGCTGGA ACAAGGAGAC TGCCTCTGGC AAGTCTTCTG 2 00 

GTAAAACCCT TCTCGATGCC ATTGACGCCA TTGAACCCCC AACCCGTCCT 250 

ACCGATAAGC CCCTCCGTCT TCCCCTCCAG GATGTGTACA AAATCTCTGG 300 

15 TATTGGCACT GTTCCCGTCG GACGTGTTGA GACTGGTGTC ATCAAGCCCG 350 

GTATGGTCGT GACTTTCGCT CCCTCCAACG TCACCACTGA AGTCAAGTCC 400 

GTCGAAATGC ACCACCAACA ACTCCAGGCT GGTTACCCTG GCGACAACGT 450 

CGGCTTCAAC GTCAAGAACG TTTCAGTCAA GGAAGTCCGC CGTGGCAACG 500 

TTGCTGGCGA CTCCAAAAAT GATCCCCCTA AGGGCTGCGA ATCCTTCAAT 550 

20 GCCCAGGTCA TCGTCCTTAA CCACCCCGGC CAGGTTGGCG CTGGTTATGC 600 

CCCAGTCCTC GACTGCCACA CTGCCCACAT TGCTTGCAAG TTCTCTGAGC 650 

TTATTGAGAA GATCGACCGC CGTACCGGAA AGTCTGTTGA GAACAACCCC 700 

AAGTTCATCA AGTCTGGTGA TGCTGCTATC GTCAAGATGG TTCCCTCCAA 750 

GCCC 754 

25 



2) INFORMATION FOR SEQ ID NO : 43 9 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) ' LENGTH: 743 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Issatchenkia orientalis 
40 (B) STRAIN: ATCC 6258 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439 

TGGGATGAAA ACAGATTTGA AGAAATTGTC AAGGAAACCC AAAACTTCAT 50 

45 CAAGAAGGTT GGTTACAACC CAAAGACTGT TCCATTCGTT CCAATCTCTG 100 

GTTGGAATGG TGACAACATG ATTGAAGCAT CCACCAACTG TC CATGGT AC 150 

AAGGGTTGGA CTAAGGAAAC CAAGGCAGGT GTTGTTAAGG GTAAGACCTT 2 00 

ATTAGAAGCA ATCGATGCTA TTGAACCACC TGTCAGACCA ACCGAAAAGC 2 50 

CATTAAGATT ACCATTACAA GATGTTTACA AGATTGGTGG TATTGGTACT 3 00 

50 GTGCCAGTCG GTAGAGTCGA AACCGGTGTC ATTAAGCCAG GTATGGTTGT 3 50 

CACTTTTGCT CCAGCAGGTG TCACCACCGA AGTCAARTCC GTTGAAATGC 400 

ACCATGAACA ATTAGAACAA GGTGTTCCAG GTGATAACGT TGGTTTCAAC 4 50 

GTTAAGAACG TCTCTGTCAA GGATATCAAG AGAGGTAACG TTTGTGGTGA 500 

CTCCAAGAAC GACCCACCAA TGGGTGCAGC TTCYTTCAAT GCTCAAGTCA 550 

55 TTGTCTTGAA CCACCCTGGT CAAATTTCCG CTGGTTACTC TCCAGTCTTG 60 0 

GATTGTCACA CTGCCCACAT TGCATGTAAG TTCGACGAAT TAATCGAAAA 650 

GATTGACAGA AGAACTGGTA AGTCTGTTGA AGACCATCCA AAGTCYGTCA 700 

AGTCTGGTGA TGCAGCTATC GTCAAGATGG TCCCAACCAA GCC 743 

60 

266 
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2) INFORMATION FOR SEQ ID NO: 440 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1091 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia furfur 

(B) STRAIN: ATCC 42132 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440 

CAAGCTCAAG GCTGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCTCTGT 50 

GGAAGTTCGA GACCCCTAAG TACCACGTTA CCGTCATTGA CGCTCCTGGT 100 

CACCGTGACT TCATCAAGAA CATGATTACG GGTACCTCGC AGGCTGACTG 150 

CGCTATCCTC ATCATTGCCG GTGGTACCGG TGAGTTCGAG GCTGGTATCT 2 00 

CGAAGGACGG TCAGACCCGT GAGCACGCTC TGCTCGCTTT CACCCTGGGT 2 50 

GTGCGTCAGC TCATTGTGGC CGTCAACAAG ATGGACACCA CCAAGTACTC 3 00 

GGAGGACCGC TTCAACGAGA TTGTCCGCGA AGTGTCGAAC TTCATCAAGA 3 50 

AGGTCGGTTT CAACCCCAAG ACTGTTGCCT TCGTCCCCAT CTCGGGCTGG 400 

CACGGTGACA ACATGATCGA GGCCACCACC AACATGCCTT GGTACAAGGG 450 

CTGGGAGAAG GAGACCAAGT CGGGCAAGGT CACTGGTAAG ACTCTGCTGG 500 

ACGCCATCGA CGCCATCGAG CCCCCGACCC GCCCCACTGA CAAGCCCCTG 550 

CGTCTCCCTC TGCAGGATGT GTACAAGATC GGTGGTATCG GTACTGTCCC 600 

TGTCGGTCGT GTTGAGACCG GTGTGATCAA GCCCGGTATG GTTGTGACCT 650 

TCGCTCCCTC GAACGTCACC ACTGAAGTTA AGTCGGTTGA GATGCACCAC 700 

GAGTCGCTCC CTGAGGGTCT CCCCGGTGAC AACGTTGGTT TCAACGTGAA 750 

GAACGTCTCG GTTAAGGACA TTCGCCGTGG TAACGTTGCC TCGGACTCGA 800 

AGAACGACCC CGCTCAGGAG GCTGCTTCGT TCAACGCGCA GGTCATTGTC 850 

ATGAACCACC CTGGTCAGAT CAGCAACGGT TACTCGCCCG TGCTTGACTG 900 

CCACACTGCG CACATTGCCT GCCGCTTCAA CAACATCCTC CAGAAGATCG 950 

ACCGTCGCTC GGGTAAGGTG CTTGAGGAGA ACCCCAAGTT CATCAAGTCG 10 00 

GGTGACGCTG CCATGGTGGA GATGATCCCC ACCAAGCCCA TGTGTGTGGA 1050 

GTCGTTCAAC GAGTACCCCC CTCTGGGTCG TTTCGCTGTG C 1091 



2 ) INFORMATION FOR SEQ ID NO: 441 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 749 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia pachydermatis 

(B) STRAIN: ATCC 42756 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441 

ACCACCAAGT ACTCGGAGGA CCGCTTCAAC GAGATTATTC GCGAGACTTC 50 
GAACTTCATC AAGAAGGTCG GTTACAACCC GAAGACTGTT GCCTTCGTCC 100 

267 
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CGATCTCGGG CTGGCACGGT GACAACATGA TTGAGGCGAC CXCCAACATG 150 

CCGTGGTACA AGGGCTGGGA GAAGGAGACC AAGTCGGGCA AGGCCACTGG 2 00 

TAAGACCCTT CTGGACGCTA TTGACGCCAT TGAGCCGCCG ACGCGCCCGA 250 

CCGACAAGCC TCTCCGTCTT CCTCTCCAGG ATGTGTACAA GATCGGTGGT 3 00 

5 ATCGGTACYG TCCCGGTCGG CCGTGTTGAG ACCGGTGTTA TCAAGCCCGG 350 

TATGGTTGTG ACCTTCGCTC CGTCGAACGT CACSACTGAA GTTAAGTCGG 400 

TCGAGATGCA CCACGAGCAG ATCCCTGAGG GTCTTCCGGG TGACAACGTT 450 

GGTTTCAACG TGAAGAACGT GTCGGTCAAG GACATTCGCC GTGGTAACGT 500 

CGCCTCGGAC TCGAAGAACG ACCCGGCTCA GGAGGCTGCC TCGTTCAATG 550 

10 CTCAGGTCAT TGTGATGAAC CACCCTGGTC AGATCAGCAA CGGTTACTCG 600 

CCRGTGCTCG ACTGCCACAC TGCTCACATT GCCTGCCGCT TCAACAACAT 650 

CCTCCAGAAG ATCGACCGTC GTTCGGGTAA GGTTCTYGAA GAGAACCCCA 700 

AGTTCATCAA GTCGGGTGAC GCTGCCATGG TTGAGATGAT CCCGACCAA 749 



15 



30 



2) INFORMATION FOR SEQ ID NO: 442 



( i ) SEQUENCE CHARACTERISTICS : 
2 0 (A) LENGTH: 1150 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malbranchea filamentosa 

(B) STRAIN: ATCC 48174 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 442 



ACTGAAGGCC GAGCGTGAGC GTGGTATCAC CATCGATATC GCCCTCTGGA , 50 

AGTTCGAGAC CCCCAAGTAC CATGTCACCG TCATTGGTAC GTTCGACATG 100 

35 TTCGACCTTT TGC CTAGTGT CCCCTTCTAA CCACAGTTTA TAGACGCCCC 150 

TGGCCATCGT GATTTCGTCA AGAACATGAT CACTGGTACT TCCCAGGCTG 200 

ATTGCGCTAT CCTCATCATT GCTTCCGGCA CTGGTGAATT CGAGGCTGGT 250 

ATCTCCAAGG ATGGCCAGAC CCGTGAGCAC GCTCTGCTTT CCTTCACCCT 3 00 

CGGTGTTAGG CAGCTCATTG TCGCCCTCAA CAAGATGGAC ACTGTCAACT 350 

40 TCGCTGAGGC CCGTTACAAC GAGATTGTCA AGGAAGTCTC CAACTTCATC 400 

AAGAAGGTCG GCTACAACCC CAAGGCCGTT CCTTTCGTCC CCATCTCCGG 450 

TTTCGAGGGT GACAACATGA TCGAGGCCTC CACCCGCATT CCTTGGTACA 500 

AGGGCTGGAA CAAGGAGACC GCCAGTGGCA AGAGCACCGG CAAGACCCTY 550 

CTCGAGGCCA TTGATGCCAT CGAACCCCCG GTCCGTCCCA CCGACAAGCC 600 

4 5 CCTYCGTCTC CCTCTTCAGG ATGTGTACAA GATCTCCGGT ATTGGCACTG 650 

TTCCTGTCGG TCGTGTTGAG ACTGGTGTCA TCAAGCCTGG TATGGTCGTT 700 

ACTTTCGCCC CCGCCAACGT CACCACTGAA GTCAAGTCCG TCGAGATGCA 750 

CCACCAGCAG CTCCAGGCCG GTAACCCCGG TGACAACGTC GGCTTCAACG 800 

TCAAGAACGT TTCCGTCAAG GAAGTCCGCC GTGGCAACGT TGCCTCCGAC 850 

50 TCCAAGAACG ACCCCGCCAA GGGCTGCGAC TCCTTCAACG CCCAGGTCAT 900 

CGTCCTTAAC CACCCCGGTC AGGTCGGTGC TGGATACGCT CCCGTCCTCG 950 

ATTGCCACAC TGCCCACATT GCTTGCAAGT TCTCTGAGCT TCTTGAGAAG 1000 

ATCGATCGCC GTACCGGTAA ATCCGTTGAG GACCACCCCA AGTTCATCAA 1050 

GTCTGGTGAT GCCGCTATCG TCAAGATGGT TCCCTCCAAG CCTATGTGCG 1100 

55 TTGAGGCTTT CACTGACTAC CCTCCCCTTG GTCGTTTCGC CGTCCGTGAC 1150 



60 



2) INFORMATION FOR SEQ ID NO: 443 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1099 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

i 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Metschnikowia pulcherrima 

(B) STRAIN: DSM 70336 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443 

GGACAAGTTG AAGGCTGAGA GAGAGAGAGG TATCACCATC GACATTGCCT 50 

TGTGGAAGTT CGAGACTCCT AAGTACCACG TCACCGTYAT TGACGCCCCA 100 

GGTCACAGAG ATTTCATCAA GAACATGATC ACTGGTACTT CCCAGGCTGA 150 

CTGTGCYATC TTGATTATCG CYGGTGGTGT TGGTGAGTTC GAGGCTGGTA 2 00 

TCTCCAAGGA TGGCCAGACC AGAGAGCACG CTTTGTTGGC YTACACCTTG 2 50 

GGTGTTAGAC ARTTGATTGT TGCCGTCAAC AAGATGGACT CTGTCAAGTG 3 00 

GGACAAGAAC AGATTTGAGG AGATCATCAA GGAGACCTCT AACTTCGTCA 350 

AGAAGGTTGG TTACAACCCT AAGACYGTGC CATTCGTGCC AATYTCTGGT 400 

TGGAACGGTG ACAACATGAT TGAGGCYTCC ACTAACTGCC CATGGTACAA 450 

GGGTTGGGAG AAGGAGACCA AGGCCGGTAA GTCTWCCGGT AAGACCTTGT 500 

TGGAGGCCAT TGACGCCATT GAGCCACCAA CCAGACCTAC CGACAAGGCC 550 

TTGAGATTGC CTTTGCAGGA TGTCTACAAG ATCGGTGGTA TCGGAACGGT 600 

GCCAGTCGGC CGTGTCGAGA CCGGTGTCAT TAAGGCCGGT ATGGTTGTYA 650 

CCTTYGCCCC AGCTGGTGTC ACCACTGAGG TCAAGTCCGT CGAGATGCAC 700 

CACGAGCAGT TGGTCGAGGG TCTTCCAGGT GACAAYGTTG GTTTCAACGT 750 

CAAGAACGTC TCCGTTAAGG AGATCAGAAG AGGTAACGTC TGTGGTGACT 800 

CCAAGCAGGA CCCACCAAAG GGTGCCGCTT CTTTCACCGC YCAGGTTATT 850 

GTGTTGAACC ACCCTGGTCA GATCTCCTCT GGTTACTCTC CAGTGTTGGA 900 

CTGYCACACC GCCCACATTG CCTGTAARTT CGACACCTTG TTGGAGAAGA 950 

TTGACAGAAG AACTGGTAAG TCCTTGGAGT CYGAGCCTAA GTTCGTCAAG 1000 

TCYGGTGACG CCGCCATTGT CAAGATGGTG CCAACCAAGC CAATGTGTGT 1050 

TGAGGCTTTC ACCGACTACC CACCTTTGGG TAGATTCGCC GTCAGAGAC 1099 



2) INFORMATION FOR SEQ ID NO: 444 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1153 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paecilomyces lilacinus 

(B) STRAIN: ATCC 42570 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444 

CAAGCTCAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCCCTCT 50 

GGAAGTTCGA GACTCCCAAG TACTATGTCA CCGTCATTGG TACGTCGACT 100 

CGCGCGAGAC TGGTCGCAAT TTCCACGTCG CTAACGTGCT TGAACAGACG 150 

CTCCCGGCCA CCGTGACTTC ATCAAGAACA TGATCACTGG TACCTCCCAG 200 

GCTGACTGCG CTATCCTCAT TATCGCTGCC GGCACTGGTG AGTTCGAGGC 250 
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TGGTATCTCC AAGGATGGCC 
CCCTCGGTGT TAAGCAGCTC 
AAGTGGTCTG AGGCCCGTTT 
CATCAAGAAG GTCGGCTACA 
5 CTGGTTTCCA CGGCGACAAC 
TACAAGGGCT GGGAGAAGGA 
CCTCCTTGAG GCCATCGACT 
AGCCCCTCCG CCTTCCCCTT 
ACAGTCCCTG TCGGCCGTAT 

10 CGTGACCTTC GCTCCTTCCA 
TGCACCACGA GCAGCTCTCC 
AACGTCAAGA ACGTCTCCGT 
TGACTCCAAG AACGACCCCC 
TCATCGTCCT CAACCACCCC 

15 CTCGACTGCC ACACCGCCCA 
GAAGATCGAC CGCCGTACCG 
TCAAGTCTGG CGACTCTGCC 
TGCGTTGAGG CTTTCACCGA 
TGA 

20 



AGACCCGTGA GCACGCTCTG 
ATCGTCGCTA TCAACAAGAT 
CCAGGAGATC ATCAAGGAGA 
ACCCCAAGAC CGTCGCTTTC 
ATGCTTTCCC CCTCCACCAA 
GACCAAGGCT GGCAAGTCCA 
CCATCGAGCC CCCCAAGCGC 
CAGGATGTGT ACAAGATCGG 
CGAGACTGGT GTCATCAAGC 
ACGTCACCAC CGAAGTCAAG 
GAGGGTGTCC CCGGTGACAA 
CAAGGAGATC CGTCGTGGCA 
CTCTGGGTGC CGCTTCTTTC 
GGCCAGGTCG GTGCTGGCTA 
CATTGCCTGC AAGTTCGCCG 
GCAAGTCTGT CGAGTCCGCC 
ATCGTCAAGA TGATTCCCTC 
CTACCCTCCT CTGGGCCGCT 



PCT/CAOO/01150 



CTCGCCT2R3T 


3CKT 


GGACACCACC 


350 


CCTCCAACTT 


400 


GTCCCCATCT 


450 


CTGCCCCTGG 


500 


CCGGCAAGAC 


550 


CCCAGCGACA 


600 


CGGTATCGGC 


650 


CCGGCATGGT 


700 


TCCGTTGAGA 


750 


CGTCGGCTTC 


800 


ACGTCGCCGG 


850 


GATGCCCAGG 


900 


CGCCCCCGTC 


950 


AGATCAAGGA 


1000 


CCCAAGTTCA 


1050 


CAAGCCCATG 


1100 


TCGCCGTCCG 


1150 




1153 



2) INFORMATION FOR SEQ ID NO: 445 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 763 bases 

(B) . TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 
35 (B) STRAIN: ATCC 32071 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445 



TACCACTAAG TGGTCCGAGA CCCGATTCAA TGAAATTATC AAGGAAGTCA 50 

40 CCAATTTCAT TAAGAAGGTC GGATATAACC CCAAGACTGT TCCTTTCGTT 100 

CCCATTTCTG GTTTCGAGGG TGACAACATG ATCGAGCCCT CTGCCAACTG 150 

CCCATGGTAC AAGGGCTGGT CCAAGGAGAC TGCTCAGGGC AAGTACTCTG 200 

GCAAGACCCT TCTTGAGGCC ATCGACGCCA TTGAGCCCCC CACCCGTCCT 250 

ACCGATAAAC CTCTCCGTCT TCCCCTCCAG GATGTCTACA AGATCTCCGG 300 

45 TATTGGCACT GTTCCTGTCG GACGTGTTGA GACTGGAGTC ATCAAGCCCG 350 

GTATGGTCGT GACCTTCGCT CCCGCCAACG TCACCACTGA AGTCAAGTCC 4 00 

GTCGAAATGC ACCACCAGCA GCTTACCGCC GGTAACCCCG GTGACAACGT 450 

CGGCTTCAAC GTCAAGAATG TTTCCGTCAA AGAAGTCCGC CGTGGTAACG 500 

TTGCCGGTGA CTCTAAGAAT GATCCCCCAA AGGGCTGCGA TTCCTTCAAT 550 

50 GCCCAGGTCA TCGTCCTCAA CCACCCTGGT CAGGTTGGCG CTGGTTATGC 600 

CCCAGTCCTC GACTGCCATA CTGCCCACAT TGCCTGCAAA TTCGCTGAGC 650 

TCCTTGAGAA GATTGATCGA CGAACCGGAA AGTCTGTTGA GAACAACCCC 700 

AAGTTCATCA AGTCCGGTGA TGCTGCTATC GTCAAGATGA TTCCTTCCAA 750 

GCCCATGTGC GTC 763 

55 



2) INFORMATION FOR SEQ ID NO: 446 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1346 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffei 
10 (B) STRAIN: ATCC 64101 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 446 

AAGCTCAAGG CTGAGCGTGA GCGTGGTATC ACCATCGATA TTGCTCTCTG 50 

15 GAAGTTCCAG ACTGCCAAGT ACGAGGTTAC CGTCATTGAC GCCCCCGGTC 100 

ACCGTGATTT CATCAAGAAC ATGATCACTG GTACCTCCCA GGCCGATTGC 150 

GCTATTCTCA TCATTGCCTC TGGTACTGGT GAATTCGAGG CTGGTATCTC 200 

CAAGGATGGC CAGACTCGTG AGCACGCTCT TTTGGCTTTC ACCCTCGGTG 250 

TCCGTCAGCT CATTGTTGCC CTCAACAAGA TGGACACTTG CAAGTGGTCT 300 

20 CAGGGTGAGT ACTCGTACCT GCGTTTGGCC TTGAATATCT TACTAATGCA 3 50 

CCATAGATCG TTACAACGAA ATTGTCAAGG AGACTTCCAA CTTCATCAAG 400 

AAGGTCGGAT ACAACCCCAA GAACGTTCCT TTCGTTCCTA TCTCCGGTTT 450 

CAACGGTGAC AACATGCTTG AGCCCTCCCC CAACTGCCCC TGGTACAAGG 500 

GTTGGGAGAA GGAGACCAAG GCCGGTAAGG TCACTGGTAA GACCCTCCTC 550 

25 GAGGCCATCG ACGCCATTGA GCCCCCTACC CGTCCCGCCA ACAAGGTTAG 600 

TCCCTCCTCG ACTACTCAAA CCCTCCTCAT AAGTTCAGAT TACTGACTCG 650 

TTCACAGCCC CTCCGTCTTC CCCTCCAGRA TGTCTACAAG ATCGGTGGTA 7 00 

TTGGAACGGT TCCCGTCGGT CGTGTTGAGA CTGGTACCAT CGTTCCTGGT 750 

ATGGTTGTCA CCTTGTAAGT CACTCTCCTC GCTTATCCTA CCTGAAATCA 800 

30 TCATGTGCTA ACTTGACACT CAGCGCTCCC GCCAACGTCA CCACTGAAGT 850 

CAAGAGTGTT GAAATGCACC ACCAGCAGCT CACTGCCGGT CAGCCCGGTG 900 

ACAACGTTGG TTTCAACGTG AAGAACGTCT CCGTCAAGGA AATCCGTCGT 950 

GGTAACGTTG CTGGTGACAG CAAGAACGAC CCCCCTGCCG GTGCTGCCTC 1000 

CTTCAACGCC CAGGTCATCG TCCTCAACCA CCCCGGTCAG GTCGGTGCTG 1050 

3 5 GTTACGCCCC AGTCCTCGAT TGCCACACTG CCCACATTGC TTGCAAGTTC 1100 

GCTGAGCTCC TCGAGAAGAT TGACCGTCGT ACCGGAAAGT CTGTTGAGGA 1150 

CCACCCCAAG TTCATCAAGT CCGGTGACGC TGCCATCGTC AAGATGATTC 12 00 

CTTCCAAGCC CATGTGTGTT GAGGCTTTCA CCGAGTACCC TCCTCTCGGT 1250 

CGTTTCGCCG TTCGCGAGTA AGTTTTATCT CCGTTGTCTA TTTTCCATCC 13 00 

40 TTCCCTTCTC CTCCGTCTTC CATATATACT TTTTCAGTTA TATGTG 1346 



2) INFORMATION FOR SEQ ID NO: 447 



45 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1094 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 18205 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 447 

60 AAGTTAAAAG CTGAACGTGA AAGAGGTATC ACTATTGATA TTGCTTTATG 50 
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GAAATTCGAA ACTCCAAAAT AC CATGTTAC CGTTATTGAT GCTCCAGGTC 100 
ACAGAGATTT CATCAAAAAT ATGATTACTG GTACTTCCCA AGCTGATTGT 150 
GCTATYTTAA TTATTGCCGG TGGTATTGGT GAATTCGAAG CTGGTATCTC 200 
AAAAGATGGT CAAACCAGAG AACACGCTTT NTTAGCTTAC ACCTTAGGTG 250 
5 TTAAACAATT GATTGTTGCT ATCAACAAGA TGGATTCCGT TAAATGGGAT 3 00 

GAATCTCGTT TCGAAGAAAT TGTCAAGGAA ACYTCAAACT TTATCAAGAA 350 
AGTTGGTTAC AACCCAAAAA CTGTTCCATT CGTTCCAATC TCAGGTTGGA 400 
ATGGTGATAA CATGATTGAA CCATCAWCTA ACTGTCCATG GTACAAAGGT 450 
TGGAAAAAAG AAACCAAAGC TGGTGAAGCT AAAGGTAAAA CTTTATTAGA 500 

10 AGCCATTGAT GCTATTGACC CACCATCAAG ACCAACTGAT AAACCATTAC 550 
GTTTACCATT ACAAGATGTT TACAARATTG GTGGTATTGG TACTGTGCCA 600 
GTCGGTAGAG TTGAAACCGG TGTTATCAAA CCAGGTATGG TTGTTACCTT 650 
TGCCCCAGCT GGTGTTACCA CTGAAGTCAA ATCTGTTGAA ATGCATCATG 700 
AACAATTGAC TGAAGGTTTA CCAGGTGACA ATGTTGGTTT CAACGTTAAG 750 

15 AATGTTTCTG TTAAAGAAAT CCGTCGTGGT AACGTCTGTG GTGACTCTAA 800 
AAACGATCCA CCAAAAGCTG CTGAATCATT CAATGCTCAA GTTATTGTCT 850 
TAAACCATCC AGGTCAAATC TCTGCTGGTT ACTCTCCAGT TTTAGATTGT 900 
CACACTGCTC ACATTGCTTG TAAATTTGAC ACTTTAATTG AAAAAATTGA 950 

CAGACGTACT GGTAAGAAAT TAGAAGAAGC TCCAAAATTC ATCAAATCAG 1000 

2 0 GTGATGCTGC TATTGTTAAA TTTGTTCCAT CAAAACCATT ATCAGTTGAA 1050 

GCTTTCACTG ACTACCCACC ATTAGGTCGT TTCGCTGTCA GAGA 1094 



25 2) INFORMATION FOR SEQ ID NO: 448 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1100 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 2149 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448 



CTTAGATAAG TTAAAAGCTG AACGTGAAAG AGGTATCACT ATTGATATTG 50 

CTTTATGGAA ATTCGAAACT CCAAAATACC ATGTTACCGT TATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAAAATATG ATTACTGGTA CTTCCCAAGC 150 

TGATTGTGCT ATMTTAATTA TTGCCGGTGG TATTGGTGAA TTCGAAGCTG 200 

45 GTATCTCAAA AGATGGTCAA ACCAGAGAAC ACGCTTTRTT AGCTTACACC 250 

TTAGGTGTTA AACAATTGAT TGTTGCTATC AACAAGATGG ATTCCGTTAA 300 

ATGGGATGAA TCTCGTTTCG AAGAAATTGT CAAGGAAACY TCAAACTTTA 350 

TCAAGAAAGT TGGTACAACC CAAAAACTGT TCCATTCGTT CCAATCTCAG 400 

GTTGGAATGG TGATAACATG ATTGAACCAT CAACTAACTG TCCATGGTAC 450 

50 AAAGGTTGGA AAAAAGAAAC CAAAGCTGGT GAAGCTAAAG GTAAAACTTT 500 

ATTAGAAGCC ATTGATGCTA TTGATCCACC ATCAAGACCA ACTGATAAAC 550 

CATTACGTTT ACCATTACAA GATGTTTACA ARATTGGTGG TATTGGTACT 600 

GTGCCAGTCG GTAGAGTTGA AACCGGTGTT ATCAAAC CAG GTATGGTTGT 650 

TACCTTTGCC CCAGCTGGTG TTACCACTGA AGTCAAATCT GTTGAAATGC 700 

55 ATCATGAACA ATTGACTGAA GGTTTACCAG GTGACAATGT TGGTTTCAAC 750 

GTTAAGAATG TTTCTGTTAA AGAAATCCGT CGTGGTAACG TCTGTGGTGA 800 

CTCTAAAAAC GATCCACCAA AAGCTGCTGA ATCATTCAAT GCTCAAGTTA 850 

TTGTCTTAAA CCATCCAGGT CAAATCTCTG CTGGTTACTC TCCAGTTTTA 900 

GATTGTCACA CTGCTCACAT TGCTTGTAAA TTTGACACTT TAATTGAAAA 950 

60 AATTGACAGA CGTACTGGTA AGAAATTAGA AGAAGCTCCA AAATTCATCA 1000 
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WO 01/23604 PCT/CA00/01 150 

AATCAGGTGA TGCTGCTATT GTTAAATTTG TTCCATCAAA ACCATTATCA ' 1050 
GTTGAAGCTT TCACTGACTA CCCACCATTA GGTCGTTTCG CTGTCAGAGA 1100 



2) INFORMATION FOR SEQ ID NO: 449 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1201 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudallescheria boydii 

(B) STRAIN: ATCC 44331 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449 

CAAGCTCAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCCCTCT 50 

GGAAGTTCGA GACCCCCAAG TACCAGGTCA CCGTCATTGG TATGTCTTTG 100 

TGCTTTTTGT GCTTTTTGTG CCTGTGCCTC GCACAATTCC AGCCCTCGAT 150 

25 AATTATGAAC CTCGTACTAA TATGTCGTTC TCCCACTACC CACAGACGCC 2 00 

CCCGGCCATC GTGATTTCAT CAAGAACATG ATTACTGGTA CCTCCCAGGC 2 50 

TGATTGCGCC ATTCTCATCA TTGCCTCCGG TACTGGTGAG TTCGAGGCTG 3 00 

GCATCTCCAA GGATGG CCAG ACCCGTGAGC ACGCTCTTCT CGCCTTCACC 3 50 

CTCGGTGTCA AGAACCTCAT TGTTGCCATC AACAAGATGG ACACCAACAA 4 00 

30 CTGGTCCGAG GACCGATACA AGGAGATCAT CAAGGAGACC TCCAACTTCA 450 

TCAAGAAGGT CGGCTACAAC CCCAAGGCCG TTCCTTTCGT CCCCATCTCC 500 

GGTTTCCACG GAGACAACAT GCTTACCCCC TCCACCAACT GCCCCTGGTA 550 

CAAGGGTTGG GTCCGTGAGG TCAAGGGTAA CACCCTTACC GGCAAGACCC 600 

TTCTCGAGGC CATCGACTYC ATCGAGCCCC CCAAGCGTCC CACCGAGAAG 650 

35 CCCCTCCGTC TTCCCCTTCA GGACGTCTAC AAGATCGGTG GTATTGGCAC 700 

TGTGCCCGTC GGCCGTATCG AGACCGGTAT CCTCAAGCCC GGTATGGTCG 750 

TCACCTTCGC TCCCTCCAAC GTCACCACTG AAGTCAAGTC CGTCGAGATG 8 00 

CACCACGAGC AGCTTACCGA GGGTGTCCCC GGTGACAACG TTGGTTTCAA 850 

CGTGAAGAAC GTCTCCGTCA AGGATATCCG CCGTGGCAAC GTCTGCGGTG 900 

40 ACTCCAAGAA CGACCCCCCC GCTGCTGCCG CCTCTTTCCA GGCCCAGGTC 950 

ATTGTCCTCA ACCACCCCGG CCAGATCGGT GCTGGTTACG CTCCCGTTCT 1000 

TGACTGCCAC ACTGCCCACA TTGCTTGCAA GTTCGCCGAG CTCCTTGAGA 1050 

AGATCGACCG CCGTACCGGT AAGTCGGTCG AGAACAACCC CAAGTTCGTC 1100 

AAGTCTGGTG ATGCCGCCAT CGTCAAGATG GTTCCCTCCA AGCCCATGTG 1150 

45 TGTTGAGTCC TTCACCGAGT ACCCCCCTCT CGGTCGTTTC GCCGTCCGTG 12 00 

A 1201 



50 2) INFORMATION FOR SEQ ID NO: 450 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1095 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Rhizopus oryzae 

(B) STRAIN: ATCC 56015 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450 

5 

CAAGCTTAAG GCTGAACGTG AACGTGGTAT CACCATCGAT ATTGCTCTCT 50 

GGAAGTTCGA AACCCCCAAG TACCAAATTA CCGTTATTGA TGCTCCCGGT 100 

CACCGTGATT TCATCAAGAA CATGATTACC GGTACTTCTC AAGCCGATTG 150 

TGCTATTCTT ATCATTGCTG GTGGTACTGG TGAATTCGAA GCTGGTATCT 200 

10. CCAAGGATGG TCAAACCCGT GAACACGCCC TTTTGGCTTT CACTCTCGGT 250 

GTCCGTCAAT TGATTGTTGC TGTCAACAAG ATGGATACCA CCAAGTGGTC 3 00 

CGAAGCTCGT TTCAACGAAA TYGTCAAGGA AGTTTCTTCC TTCATCAAGA 350 

AGATTGGTTA CAACCCCAAG TCTGTTCCCT TCGTCCCCAT CTCTGGTTGG 400 

CACGGTGACA ACATGTTGGA AGAATCTACC AACATGCCCT GGTACAAGGG 450 

15 ATGGAACAAG GAAACCAAGG CTGGTGCCAA GTCTGGTAAG ACCCTCTTGG 500 

ATGCCATTGA CAACATTGAC CCTCCTACCC GTCCTGTTGA CAAGCCTCTC 550 

CGTCTTCCTC TTCAAGATGT TTACAAGATT GGTGGTATCG GTACTGTCCC 600 

CGTCGGTCGT GTCGAAACTG GTGTCATCAA GGCTGGTATG GTTGTCACCT 650 

TCGCTCCTGC TGCTGTCACC ACTGAAGTTA AGTCCGTCGA AATGCACCAC 700 

20 GAAACCCTCA CTGAAGGTCT CCCCGGTGAC AACGTCGGTT TCAACGTCAA 750 

GAACGTCTCC GTCAAGGATA TCCGTCGTGG TAACGTCTGT TCTGACTCCA 800 

AGAACGACCC CGCCAAGGAA GCCGGTTCCT TCACCGCTCA AGTCATTATC 850 

TTGAACCACC CTGGTCAAAT TGGTGCTGGT TACGCTCCYG TTTTGGATTG 900 

TCACACTGCT CACATTGCCT GTAAGTTCGC TGAATTGATC GAAAAGATTG 950 

25 ACAGACGTTC CGGTAAGTCC TTGGAAGCTA CTCCCAAGTT CGTCAAGTCT . 1000 

GGTGACTCTG CCATCGTCAA GATGATCCCC TCCAAGCCCA TGTGTGTTGA 1050 

AGCTTACACT GACTACCCTC CTCTCGGTCG TTTCGCTGTT CGTGA 1095 



30 



40 



2) INFORMATION FOR SEQ ID NO: 451 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1092 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhodotorula minuta 

(B) STRAIN: ATCC 10658 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451 

GCTGAAGGCC GAGCGAGAGC GTGGTATCAC CATCGATATC GCTCTATGGA 50 

AGTTCGAGAC CCCCAAGTAC AACGTCACCG TCATTGACGC TCCAGGACAT 100 

CGTGATTTCA TCAAGAGTGA GTTAACCATA ACATCAAACA GTGTTGCAAA 150 

50 CATCAGCTAA TGCATGTTAT GCGTCCAGAC ATGATTACTG GTACTTCCCA 200 

GGCCGATTGC GCTATTCTCA TCATCGCCAC CGGTGTTGGT GAGTTCGAGG 250 

CTGGTATCTC CAAGGATGGC CAGACCCGAG AGCACGCCCT TCTCGCCTTC 300 

ACCCTCGGTG TCAGACAGCT CATCGTTGCC TTGAACAAGA TGGACTCGGT 350 

CAAGGTAGGC TAACTTCACA ACGTCGGCTT CCCATCATTC ATTCACTTAC 400 

55 CTGTCTTGTC TTCCACCCTC CAGTTCTCCG AGTCCCGATA CGATGAAATC 450 

GTCAAGGAGA CATCCGGTTT CATCAAGAAG GTCGGATTCG ACCCCAAGGG 500 

TGTTCCCTTC GTCCCCATCT CAGGATGGCA CGGAGACAAC ATGATCGAGG 550 

AGTCCACCAA CATGCCTTGG TACAAGGGAT GGAAGAAGAC CACCAAGACC 600 

GGCGAGTACA AGGGAAAGAC CCTGCTCGAG GCCATCGACT CCATCGAGCC 650 

60 CCCCACCCGT CCTACCGACA AGCCTCTCCG ACTTCCCCTC CAGGATGTCT 700 
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ACAAGATTGG TGGTATCGGA ACAGTGCCAG TCGGACGAGT CGAGACTGGT 750 

ACCATCAAGG CTGGTATGGT CGTCACCTTC GCTCCTTCAG CTGTCACCAC 800 

CGAAGTCAAG TCTGTTGAGA TGCACCACGA GCAGCTCGAG GCTGGTCTTC 850 

CAGGTGACAA CGTCGGATTC AACATCAAGA ACGTTTCAGT CAAGGATATC 900 

5 CGAAGAGGAA ACGTCTGCGG TGACTCCAAG AACGATCCCC CCAAGGAGGC 950 

TGCTTCCTTC ACCGCCCAGG TCATTGTCCT CAACCACCCC GGTCAAATCG 1000 

GTAACGGATA CTCTCCAGTT CTCGATTGCC ACACTGGTGA GTCATTCTTC 1050 

CATATTAGTT TGAACTCTTT TGAACAATAC TAACGTGAAT CATTATACTT 1100 

TTCAGCCCAC ATTGCATGCA AGTTCGACAC CCTCCTAGAG AAGATTGACC 1150 

10 GACGATCCGG AAAGTCCATC GAAGATACCC CCAAGTTCGT CAAGTCTGGT 12 00 

GACGCCGCCA TCGTCAAGAT GGTCCCCACC AAGCCAATGT GCGTTGAGGC 1250 

TTTCACCGAC TACCCACCTC TTGGACGATT CGCCGTCCGT GA 12 92 



15 



25 



2) INFORMATION FOR SEQ ID NO: 452 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1289 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporobolomyces salmonicolor 

(B) STRAIN: ATCC 32311 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452 

AAGCTCAAGG CCGAGCGTGA GCGTGGTATC ACCATCGATA TCGCTCTCTG 50 

GAAGTTCGAG ACCCCCAAGG TGCGTTCTCA CCCCGGCTGA GGAGCACGCA 100 

CGCGAGGGCT CACGCTGCGC CTCTTACAGT ACATGATCAC CGTCATCGAC 150 

3 5 GCCCCGGGTC ACCGTGACTT CATCAAGAAC ATGATCACCG GTACCTCGCA 200 

GGCCGACTGC GCCATCCTCA TCATCGCCGC CGGTACCGGT GAGTTCGAGG 250 

CTGGTATCTC GAAGGACGGC CAGACCCGCG AGCACGCCCT CCTCGCCTTC 300 

ACCCTCGGTG TCCGTCAGCT CATCGTCGCC ATCAACAAGA TGGACACGAC 350 

CAAGTACTCG GAGGCCCGTT TCGAGGAGAT CATCAAGGAG ACCTCCAACT 400 

4 0 TCAT CAAGAA GGTCGGCTTC AACCCCAAGA ACGTCCCCTT CGTCCCCATC 450 

TCGGGATGGC ACGGTGACAA CATGATTGAG GAGACCGCCA ACATGCCCTG 500 

GTACAAGGGA TGGAAGAAGG AGACCAAGGC CGGTGAGGTC AAGGGCAAGA 550 

CCCTCCTCGA CGCCATCGAC GCGATCGAGC CCCCTTCGCG CCCTACCGAC 600 

AAGCCCCTCC GTCTTCCCCT CCAGGTTCGT TTCCCTGCTC GCGGTTTACG 650 

4 5 CTGCTACTTC GAGCTGACCC GCGAGCTCTG CCCGAACAGG ATGTCTACAA 700 

GATCGGTGGT ATCGGCACAG TCCCCGTCGG CCGTGTCGAG ACCGGCACGA 750 

TCAAGGCCGG TATGGTCGTC GTCTTCGCCC CGGCCAACGT CACCACTGAG 800 

GTCAAGTCGG TCGAGATGCA CCACGAGCAG CTCGAGGCTG GTCTCCCGGG 850 

AGACAACGTC GGCTTCAAGT GCGTCTCATC ATGTTTTTGC TTCGCTCGGC 900 

50 CATTTTTTCA GTCCTGACCC CGTTTTGCCC CTCGACAGCG TCAAGAACGT 950 

TTCCGTTAAG GACATCCGTC GCGGTAACGT CTGCGGTGAC TCGAAGAACG 1000 

ACCCCCCCAA GGAGGCCGCT TCCTTCAAGG CCCAGGTCAT CGTCATGAAC 1050 

CACCCCGGTC AGATCGGCAA CGGTTACGCT CCCGTTCTCG ACTGCCACAC 1100 

CGCCCACATT GCCTGCAAGT TCGACACCCT CCTCGAGAAG ATCGACCGTC 1150 

55 GCTCGGGCAA GTCGATTGAG GACCTCCCCA AGTTCGTCAA GTCGGGTGAC 1200 

GCCGCCATCG TCAAGATGGT TCCCTCCAAG CCGATGTGTG TCGAGTCGTT 1250 

CGCCGAGTAC CCCCCTCTCG GACGTTTCGC CGTCCGTGA 1289 



60 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1070 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckii 

(B) STRAIN: WSA-14 8 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 453 

GTGAGCGCGG TATCACCATC GATATTGCTC TGTGGAAGTT CGAGACCCCC 50 

AAGTACTACG TCACCGTCAT TGACGCCCCC GGTCATCGCG ATTTCATCAA 100 

GAACATGATC ACTGGTACCT CGCAGGCCGA CTGCGCCATT CTCATCATTG 150 

20 CCGCTGGTAC TGGTGAGTTC GAGGCTGGTA TCTCCAAGGA TGGCCAGACT 2 00 

CGTGAGCACG CTCTGCTCGC CTACACCCTG GGTGTGCGGC AGCTGATCGT 250 

CGCCATCAAC AAGATGGACA CGGCCAAGTG GGCTGAGGCT CGTTACCAGG 3 00 

AGATCATCAA GGAGACCTCC AACTTCATCA AGAAGGTCGG CTACAACCCC 3 50 

AAGACTGTTG CCTTCGTCCC CATCTCGGGC TTCCACGGCG ACAACATGCT 400 

25 TACTCCCTCG ACCAACTGCC CCTGGTACAA GGGCTGGGAG AAGGAGGGCA 4 50 

AGAGCGGCAA GGTTACCGGT AAGACTCTGC TGGACGCCAT TGACGCCGTC 500 

GAGCCCCCCA AGCGCCCCAC GGACAAGCCC CTGCGTCTGC CCCTCCAGGA 550 

TGTCTACAAG ATCGGCGGTA TCGGCACTGT . CCCTGTCGGC CGTATCGAGA 600 

CTGGTGTCCT GAAGCCCGGC ATGGTCGTCA CCTTTGCCCC GTCCAACGTC 650 

30 ACCACTGAAG TCAAGTCCGT CGAGATGCAC CACGAGCAGC TTGTTGAGGG 700 

TGTTCCCGGC GACAACGTCG GCTTCAACGT CAAGAACGTC TCCGTCAAGG 750 

AGATCCGTCG TGGCAACGTT GCCGGTGACT CCAAGAACGA CCCCCCCTCG 800 

GGCGCCGCCA CCTTCAACGC CCAGGTCATT GTCCTGAACC ACCCCGGCCA 850 

GGTCGGCAAC GGCTACGCCC CGGTTCTGGA CTGCCACACC GCCCACATTG 900 

35 CCTGCAAGTT CACCGAGATC CTTGAGAAGA TCGACCGCCG TACCGGCAAG 950 

TCGGTTGAGA ACAACCCCAA GTTCATCAAG TCGGGTGACG CCGCCATTGT 1000 

CAAGCTGACG CCCTCGAAGC CCATGTGCGT TGAGGCCTTC ACTGACTACC 1050 

CCCCTCTGGG CCGTTTCGCC 1070 

40 

2) INFORMATION FOR SEQ ID NO: 454 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 10 92 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Stephanoascus ciferrii 

(B) STRAIN: ATCC 52550 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 454 



CTTAAGTCTG AGCGTGAGAG AGGTATCACC ATCGATATTG CTCTCTGGAA 50 

ATTCGAGACT CCTAAGTACA ACGTTACCGT CATTGATGCT CCAGGTCACA 100 

60 GAGATTTCAT CAAGAACATG ATTACTGGTA CCTCCCAGGC CGATCTTGCC 150 

276 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/C A00/01 150 

ATCCTTATCA TTGCTGGTGG TGTCGGTGAG TTCGAGGCTG GTATCTCCAA * 23tf 

GGACGGTCAG ACCAGAGAGC ACGCTCTTCT TTCTTTCACC CTTGGTGTCA 250 

GAAACATGAT TGTTGCTGTC AACAAGATGG ACTCCGTCAA GTGGTCTGAG 300 

GATCGTTTCA ACGAAATTGT CAAGGAGACC TCCAACTTCG TCAAGAAGGT 350 

5 TGGTTACAAC CCTAAGAATA TTGCTTTCGT TCCTATCTCC GGTTGGAACG 400 

GTGACAATAT GATTGAGCCA TCCACCAACT GCCCATGGTA CAAGGGTTGG 450 

GAGCGTGAGA CCAAGAACGG TACTGCCAAG GGTAAGACCA TCTTGGAGGC 500 

CATTGACTCT ATGGAGCCAC CTTCCAGACC AGTTGACAAG CCTCTCCGTC 550 

TTCCTCTTCA GGACGTTTAC AAGATTGGTG GTATTGGTAC GGTGCCAGTT 600 

10 GGTCGTGTTG AGACTGGTGT TATCAAGCCA GGTATGGTTG TTACCTTTGC 650 

CCCAGCTGGT GTCACCACTG AAGTCAAGTC TGTCGAGATG CACCACGAAC 700 

AGATCCCAGA AGGTACCCCA GGTGACAACG TTGGTTTCAA CGTCAAGAAC 750 

GTCTCCGTCA AGGAAATCAG ACGTGGTAAC GTTACTGGTG ACTCCAAGAA 800 

CGACCCACCA AAGGGCTGCG ACTCTTTCAA CGCTCAGGTC ATCATCTTCA 850 

15 ACCACCCTGG TCAGATCTCT GCTGGTTACG CTCCAGTTTT GGACTGCCAC 900 

ACTGCTCACA TTGCTTGCAA GTTTGAGGAG CTCATTGAGA AGATTGACAG 950 

ACGTTCCGGT AAGAAGGTCG AAGACTCTCC TAAGTTCGTC AAGGCCGGTG 1000 

ATGCCGCCAT TGTCAAGATG GTTCCATCCA AGCCAATGTG TGTTGAAACC 1050 

TTCACTGAGT ACCCACCTCT TGGTCGTTTC GCCGTCCGTG AC 10 92 

20 



2) INFORMATION FOR SEQ ID NO: 455 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1149 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 
35 (B) STRAIN: WSA-225 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455 

GCCGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCCCTCT GGAAGTTCGA 50 

40 GACCCCCAAG TACAATGTCA CCGTCATTGG TATGTTTTCT CTTTACCTTT 100 

CCCCTCCATC GTCTTGCTGT GCCATAACTA ACGAGAGTAG ACGCCCCCGG 150 

TCACCGTGAC TTCATCAAGA ACATGATCAC TGGTACCTCC CAGGCCGACT 200 

GTGCTATTCT CATCATTGCT GCCGGTACTG GTGAGTTCGA GGCTGGTATC 250 

TCCAAGGATG GCCAGACCCG TGAGCACGCT CTGCTCGCCT TCACCCTTGG 300 

45 TGTCAAGCAG CTCATCGTTG CCATCAACAA GATGGACACC ACCAACTGGT 350 

CCGAGGACCG TTTCAAGGAA ATCATCAAGG AAGTCACCAA CTTCATCAAG 400 

AAGGTTGGCT ACGACCCCAA GGGTGTTCCA TTCGTTCCAA TCTCTGGTTT 450 

CAACGGTGAC AACATGATTG AGGCCTCCAC CAACTGCCCA TGGTACAAGG 500 

GATGGAACAA GGAGACTAAG GCCGGTGGTG CCAAGACTGG CAAGACCCTC 550 

50 CTCGAGGCCA TCGATGCCAT CGACATGCCA ACCCGTCCTA CCGACAAGCC 600 

CCTCCGTCTC CCACTCCAGG ATGTCTACAA GATCTCTGGT ATCGGAACTG 650 

TACCAGTCGG TCGTGTTGAG ACCGGTATCA TCAAGCCCGG TATGGTCGTC 700 

ACCTTCGCCC CTGCCAACGT CACCACTGAA GTCAAGTCCG TCGAAATGCA 750 

CCACCAGCAG CTTCAGCAGG GTGTCCCCGG TGACAACGTC GGCTTCAACG 800 

55 TCAAGAACGT TTCCGTCAAG GAAGTCCGCC GTGGTAACGT TGCCGGTGAC 850 

TCCAAGAACG ACCCACCATC CGGCTGTGCC TCCTTCAACG CCCAGGTCAT 900 

CGTCCTCAAC CACCCCGGCC AGATCGGTGC TGGTTACGCC CCAGTCCTCG 950 

ACTGCCACAC TGCTCACATT GCTTGCAAGT TCGCTGAGCT CCTCGAGAAG 1000 

ATTGACCGCC GTACCGGTAA ATCCGTCGAA GCCAACCCCA AGTTCGTCAA 1050 

60 GTCTGGTGAT GCCGCTATCG CCAAGATGGT TCCCTCCAAG CCTATGTGCG 1100 
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5 2) INFORMATION FOR SEQ ID NO: 456 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichosporon cutaneum 

(B) STRAIN: ATCC 62965 



20 



45 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456 



TCTTGACAAG CTTAAAGCTG AACGTGAACG TGGTATCACC ATTGATATCG 50 

CTCTCTGGAA GTTCGAAACT CCTAAGTACT ACGTTACTGT TATTGATGCT 100 

CCAGGTCACC GTGATTTCAT CAAGAACATG ATTACTGGTA CTTCCCAAGC 150 

CGACTGCGCC ATTCTTATCA TTGCTGCCGG TGTCGGTGAA TTCGAAGCTG 200 

2 5 GTATCTCCAA GGAAGGTCAA ACCAGAGAAC ACGCTCTTCT CGCTTTCACC 250 

CTTGGTGTCA GACAACTTAT CATTGCCATC AACAAGATGG ACTCTGTCAA 300 

GTGGGACCAA AAGAGATACG AAGAAATCGT CAAGGAGGCT TCCAACTTCG 3 50 

TCAAGAAGGT TGGTTACAAC CCCAAGTCTG TTCCATTCGT TCCTATCTCT 400 

GGTTGGAACG GTGACAACAT GTTGGAACCT ACCACCAACG CCCCATGGTA 4 50 

30 CAAGGGATGG ACCAAGGAAA CCAAGGCTGG TGCCACTAAG GGTATGACTC 500 

TTATTGAAGC CATTGACGCC ATTGAACCAC CAGTAAGACC ATCCGACAAG 550 

CCACTCCGTC TCCCACTCCA AGATGTTTAC AAGATTGGTG GTATCGGAAC 600 

TGTGCCAGTC GGCCGTGTCG AAACCGGTAT CATCAAGGCC GGTATGGTCG 650 

TCACCTTTGC TCCACCAATG GTCACAACTG AAGTTAAGTC CGTTGAAATG 700 

35 CACCACGAAC AACTTGCTCA AGGTAACCCA GGTGACAACG TTGGTTTCAA 750 

CGTCAAGAAC GTTTCCGTTA AGGAAATCAG ACGTGGTAAC GTCTGTGGTG 800 

ACTCCAAGAA CGATCCACCA AAGGGCTGCG AATCTTTCAA CGCTCAAGTT 850 

ATCGTCTTGA ACCACCCTGG TCAAATCTCT GCTGGTTACT CTCCAGTTCT 900 

CGATTGCCAC ACTGCCCACA TTGCCTGCAG ATTCGACGAA CTCCTTGAAA 950 

40 AGATCGACCG TCGTTCCGGT AAGAAGATTG AAGACTCTCC AAAGTTTGTC 1000 

AAGTCTGGTG ATGCCGCTAT CGTCAAGATG ATCCCAACCA AGCCAATGTG 1050 

CGTTGAAACC TTCACTGAAT ACCCACCACT TGGTCGTTTC GCCGTCCGTG 1100 

A 1101 



2) INFORMATION FOR SEQ ID NO: 457 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1085 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Wangiella dermatitidis 

(B) STRAIN: WSA-229 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457 



PCT/CAOO/01150 



GTTGAAGGCC GAGCGTGAGC GTGGTATCAC CATCGATATC GCCCTCTGGA 50 

AGTTCGAGAC CCCCAAGTAC TATGTCACCG TCATCGACGC CCCGGGTCAT 100 

5 CGTGACTTTA TCAAGAACAT GATCACTGGT ACCTCGCAGG CCGACTGCGC 150 

CATCTTGATC ATTGCCGCCG GTACCGGTGA ATTCGAAGCC GGTATCTCCA 2 00 

AGGATGGTCA GACCCGTGAG CACGCTCTGC TCGCCTACAC CTTGGGTGTC 250 

AAGCAGCTCA TCGTCGCCAT CAACAAGATG GACACCACCA AGTGGTCCGA 300 

GGAGCGTTTC AACGAAATCA TCAAGGAGAC TTCCAACTTC ATCAAGAAGG 350 

10 TCGGCTACAA CCCCAAGGCC GTTCCTTTCG TCCCCATCTC CGGCTTCAAC 400 

GGTGACAACA TGATTGAGGT CTCCACCAAC TGCCCGTGGT ACAAGGGATG 450 

GGAGAAGGAG TCCAAGGCTG GCAAGGCCAC CGGCAAGACC CTCCTCGAGG 500 

CCATTGACGC CATCGACCCA CCCACCCGTC CCACCGACAA GCCTCTCCGT 550 

CTCCCTCTCC AGGATGTCTA CAAGATCTCT GGTATCGGAA CGGTTCCTGT 600 

15 CGGTCGTGTC GAGACCGGTA CCATCAAGGC CGGTATGGTC GTCACCTTCG 650 

CTCCGGCCAA CGTCACCACT GAAGTCAAGT CCGTCGAAAT GCACCACGAG 700 

CAGCTCGCCG AGGGTCTGCC AGGTGACAAC GTTGGCTTCA ACGTCAAGAA 750 

CGTCTCCGTC AAGGAGGTTC GTCGTGGTAA CGTTGCCGGT GACTCCAAGA 800 

ACGACCCGCC CAAGGGTGCC GAGTCCTTCA ACGCCCAGGT CATTGTCCTC 850 

2 0 AACCACCCTG GTCAGATCGG TGCCGGCTAC GCTCCAGTCT TGGATTGCCA 900 

CACTGCCCAC ATTGCTTGCA AGTTCGCCGA GTTGCTCGAG AAGATCGACC 950 

GTCGTACCGG AAAGTCCATC GAGAACAACC CCAAGTTCAT CAAGTCTGGT 1000 

GATGCTGCCA TCGTCAAGAT GATTCCCAGC AAGCCCATGT GTGTCGAGGC 1050 

TTTCACCGAC TATCCTCCTC TGGGTCGTTT CGCTG 1085 

25 



2) INFORMATION FOR SEQ ID NO: 458 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 492 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 
40 (B) STRAIN: DAL-95 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458 

TGTCTTCATC CGGAATTGAT TGTGAGTCGT TCCACATGCT CACCTAGTTT 50 

45 TCGCTCGATC TTTTCACTAA CGCAAACCAT GTAGAACAAC ATTGCCAAGG 100 

CCCACGGTGG TTACTCCGTC TTCACTGGTG TTGGTGAGCG TACTCGTGAG 150 

GGTAACGATC TGTACCACGA AATGCAGGAG ACTGGTGTCA TTCAGCTCGA 200 

GGGTGAATCC AAGGTCGCAC TGGTGTTCGG ACAGATGAAC GAGCCCCCCG 250 

GTGCCCGTGC CCGTGTCGCC CTTACCGGTC TGACCATTGC CGAGTACTTC 3 00 

50 CGTGACGAGG AGGGTCAGGA CGTGCTGCTC TTCATTGACA ACATTTTCCG 350 

TTTCACCCAG GCCGGTTCTG AGGTGTCTGC CCTTCTCGGT CGTATCCCCT 4 00 

CTGCCGTCGG TTACCAGCCC ACCCTGGCCG TCGACATGGG TGGTATGCAG 450 

GAGCGTATCA CCACCACCAA GAAGGGTTCT ATTACCTCCG TC 4 92 

55 

2) INFORMATION FOR SEQ ID NO: 459 

(i) SEQUENCE CHARACTERISTICS : 
60 (A) LENGTH: 1154 bases 

279 



SUBSTITUTE SHEET (RULE 26) 



10 



WO 01/23604 PCT/CA00/01 150 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastoschizomyces capitatus 

(B) STRAIN: ATCC 10663 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 459 



GTCCGTGGTC AAGAAGTTAT TGACACTGGT GCCCCAATTA CCATTCCTGT 50 

TGGTCGTGGT ACTCTTGGTA GAATTATCAA CGTCATTGGT GAACCAATTG 100 

15 ACGAACGTGG TCCTATCAAG GCTTCTAAGT ATGCTCCTAT CCATACTGAA 150 

CCACCAACCT TTGCTGAACA ATCTACTTCT GCTGAAGTTC TTGAAACTGG 2 00 

TATCAAGGTT GTCGATCTTC TTGCTCCTTA CGCCCGTGGT GGTAAGATTG 250 

GTCTTTTCGG TGGTGCTGGT GTCGGTAAGA CTGTCTTCAT TCAAGAACTT 3 00 

ATTAACAACA TTGCCAAGGC TCACGGTGGT TTCTCTGTCT TCACTGGTGT 350 

2 0 CGGTGAAAGA ACCCGTGAAG GTAACGATCT TTACCGTGAA ATGAAGGAAA 4 00 

CTGGTGTCAT CAACCTCGAA GGTGACTCCA AGGTCGCTCT CGTTTTCGGT 450 

CAAATGAACG AACCTCCAGG TGCCCGTGCC CGTGTCGCTT TGACTGGTCT 500 

TACCATTGCC GAATACTTCC GTGATGAAGA AGGACAAGAT GTCTTGCTTT 550 

TCGTTGACAA CATTTTCAGA TTCACCCAAG CCGGTTCTGA AGTCTCTGCT 600 

25 CTTTTGGGTC GTATTCCATC TGCCGTCGGT TACCAACCTA CCCTTGCTAC 650 

CGATATGGGT GCCCTCCAAG AACGTATTAC CACCACCCAA AAGGGTTCCG 700 

TCACATCTGT CCAAGCCGTC TATGTCCCAG CAGACGATTT GACTGATCCT 750 

GCCCCAGCCA CCACTTTCGC TCACTTGGAC GCCACCACTG TCTTGTCTCG 800 

TTCCATTTCC GAATTGGGTA TCTACCCAGC TGTCGATCCT CTCGATTCCA 850 

3 0 AGTCTCGTCT TTTGGATCCT GAAGTTATTG GACACGAACA CTACGAAGTT 900 

GCCACTCAAG TTCAACAAAC CCTCCAAGCT TACAAGTCTC TCCAAGATAT 950 

CATTGCCATT TTGGGTATGG ATGAATTGTC TGAAGCTGAT AAGCTTACTG 1000 

TCGAACGTGC CCGTAAGATC CAAAGATTCC TTTCCCAACC ATTCGCTGTT 1050 

GCCGAAGTTT TCACTGGTAT CGAAGGTCGT CTCGTTCCAT TGAAGGAAAC 1100 

35 CGTCAGATCT TTCAAGGAAA TCCTTGAAGG TAAGTACGAT CACCTTCCAG 1150 

AAGC 1154 



40 2) INFORMATION FOR SEQ ID NO: 460 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 95 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 18804 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460 



CCAATTCGAC GAAGGAAACT TGCCAGCTAT TTTGAATGCT TTGACTTTGA 50 

AGAACGGTGA CCAAGACTTG GTCTTGGAAG TTGCTCAACA TTTGGGTGAA 100 

AACACCGTCA GAGCTATTGC TATGGATGGT ACTGAAGGTT TAGTCAGAGG 150 

TACCGAAGTC AACGATACCG GTGCCCCAAT CTCCGTCCCA GTCGGTAGAG 200 

60 GTACTTTAGG TAGAATCATC AATGTTGTTG GTGAACCAAT TGATGACAGA 250 

280 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

GGTCCAATTG AATGTAAGGA AAAGAAACCA ATTCACGCTG AACCACCATC 300 

ATTCGTTGAA CAATCCACTG CTGCCGARAT TTTGGAAACC GGTATCAAGG 350 

TTGTCGACTT GTTGGCCCCA TACGCCAGAG GTGGTAARAT TGGTTTATTC 4 00 

GGTGGTGCTG GTGTCGGTAA GACCGTCTTT ATCCAAGAAT TGATTAACAA 450 

5 CATTGCCAAA GCCCATGGTG GTTTCTCTGT CTTTACCGGT GTYGGTGAAA 500 

GAACCAGAGA AGGTAACGAT TTGTACCGTG AAATGAAAGA AACCGGTGTC 550 

ATCAACTTGG AAGGTGACTC CAAGGTCGCT TTGGTCTTCG GTCAAATGAA 600 

CGAACCACCA GGTGCTAGAG CTAGAGTTGC TTTGACTGGT TTGACCATTG 650 

CTGAATACTT CAGAGATGAA GAAGGTCAAG ATGTCTTGTT GTTCATTGAT 700 

10 AACATTTTCA GATTCACCCA AGCTGGTTCC GAAGTGTCTG CTTTGTTAGG 750 

TCGTATTCCA TCTGCTGTCG GTTATCAACC AACCTTAGCC ACTGATATGG 800 

GTCTTTTGCA AGAACGTATT ACCACCACCA AGAAAGGTTC CGTCACCTCT 850 

GTCCAAGCTG TCTATGTCCC AGCTGATGAT TTGACCGATC CTGCTCCAGC 900 

CACTACATTC GCCCATTTGG ATGCCACTAC TGTCTTGTCT AGAGGTATTT 950 

15 CTGAATTGGG TATCTACCCA GCTGTCGATC CATTGGATTC CAAATCCAGA 100 0 

TTATTGGACG CTTCTGTTGT TGGTCAAGAA CATTACGATG TCGCTACTGG 1050 

TGTTCAACAA ACTTTACAAG CTTACAAATC CTTACAAGAT ATCATTGCTA 110 0 

TTTTGGGTAT GGATGAATTG TCTGAAGCTG ATAAATTGAC TGTCGAAAGA 1150 

GCCCGTAAGA TCCAAAGATT CTTGTCTCAA CCATTCGCTG TTGCTGAAGT 12 00 

20 TTTGACTGGT ATCCCAGGTA GATTAGTCAG ATTGCAAGAC ACTGTCAAAT 1250 

CATTCAAGGA TGTTTTGGAA GGTAAATACG ATAACTTGCC AGAAA 1295 



25 2) INFORMATION FOR SEQ ID NO: 461 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 77 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubliniensis 

(B) STRAIN: NCPF 3949 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 461 



TAACTTGCCA GCTATTTTGA ATGCTTTGAC TTTGAAGAAC GGTGACCAAG 50 

ATTTAGTTTT GGAAGTTGCT CAACATTTGG GTGAAAACAC CGTCAGAGCT 10 0 

ATTGCTATGG ATGGTACTGA AGGTTTAGTC AGAGGTACTG AAGTCAACGA 150 

TACCGGTGCC CCAATCTCCG TTCCAGTCGG TAGAGGTACC TTAGGTAGAA 200 

45 TCATCAATGT TGTTGGTGAA CCAATTGATG ACAGAGGTCC AATTGAATGT 250 

AAGGAAAAGA AACCAATTCA TGCAGAACCA CCATCCTTCG TTGAGCAATC 3 00 

CACTGCTGCC GAAATTTTGG AAACCGGTAT CAAGGTTGTC GACTTATTGG 350 

CCCCATACGC CAGAGGTGGT AAGATTGGTT TGTTCGGTGG TGCTGGTGTC 400 

GGTAAGACCG TCTTTATCCA AGAATTGATT AACAACATTG CTAAAGCCCA 450 

50 TGGTGGTTTC TCCGTCTTTA CCGGTGTCGG TGAAAGAACC AGAGAAGGTA 500 

ACGATTTGTA CCGTGAAATG AAAGAAACCG GTGTCATCAA CTTAGAAGGT 550 

GACTCCAAGG TCGCTTTGGT CTTTGGACAA ATGAACGAAC CACCAGGTGC 600 

TAGAGCTAGA GTTGCTTTGA CTGGTTTGAC TATTGCTGAA TACTTCAGAG 650 

ATGAAGAAGG TCAAGATGTC TTGTTGTTCA TCGATAACAT TTTCAGATTC 700 

55 ACCCAAGCTG GTTCCGAAGT GTCTGCTTTG TTAGGTCGTA TTCCATCTGC 750 

CGTCGGTTAT CAACCAACCT TAGCTACTGA TATGGGTCTT TTGCAAGAAC 800 

GTATTACCAC CACCAAGAAA GGTTCCGTCA CCTCTGTCCA AGCTGTCTAT 850 

GTCCCAGCTG ATGATTTGAC CGATCCTGCT CCAGCCACCA CATTCGCCCA 900 

TTTGGATGCC ACTACTGTCT TGTCTAGAGG TATTTCTGAA TTGGGTATTT 950 

60 ACCCAGCTGT CGATCCATTG GATTCCAAAT CCAGATTATT GGACGCTGCC 1000 

281 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GTTGTTGGTC AAGAACATTA TGATGTCGCT 
GCAAGCTTAC AAATCCTTAC AAGATATCAT 
AATTGTCTGA AGCTGATAAA TTGACTGTCG 
AGATTCTTGT CTCAACCATT CGCCGTTGCT 
5 AGGTAGATTA GTCAGATTGC AAGACACTGT 
TGGAAGGTAA ATACGATCAC TTGCCAG 



PCT/CA00/01150 

ACTGGTGTTC AACAAACT&T 1050 
TGCTATTTTG GGTATGGATG 1100 
AAAGAGCCCG TAAGATTCAA 1150 
GAAGTTTTCA CTGGTATTCC 1200 
CAAATCATTC AAGGATGTTT 1250 

1277 



10 2) INFORMATION FOR SEQ ID NO: 462 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1278 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida famata 

(B) STRAIN: ATCC 62894 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462 



GTAACTTGCC AGCTATTTTG AACGCTTTGA CCTTGAAGAA CGGTGAAAAC 50 

GACTTAGTTT TAGAAGTTGC CCAACATTTG GGTGAAAACA CCGTCAGAGC 100 

TATTGCTATG GATGGTACTG AAGGTTTAGT TAGAGGTACT CCAGTTACCG 150 

ATTCTGGTGC TCCAATTTCT GTCCCAGTCG GTAGAGGTAC TTTAGGTAGA 2 00 

30 ATCTTGAACG TTATTGGTGA ACCAATTGAT GAACAAGGTC CAGTTGATGC 250 

CAAGGAAACC AGACCAATTC ACCAAGACCC ACCAGCATTC GTTGATCAAT 300 

CCACCAAGGC TGAAGTTTTG GAAACTGGTA TCAAGGTTGT CGATTTATTA 350 

GCCCCTTACG CTAGAGGTGG TAAGATTGGT TTATTCGGTG GTGCCGGTGT 400 

CGGTAAGACC GTCTTTATCC AAGAATTGAT TAACAACATT GCCAAGGCCC 450 

3 5 ATGGTGGTTT CTCTGTTTTC ACTGGTGTCG GTGAAAGAAC CAGAGAAGGT 500 

AACGATTTAT ATAGAGAAAT GAAGGAAACT GGTGTCATTA ACTTGGAAGG 550 

TGACTCCAAG GTCGCCTTGG TTTTCGGTCA AATGAACGAA CCACCAGGTG 600 

CTAGAGCTAG AGTTGCTTTA ACCGGTTTAA CCATTGCCGA ATACTTCAGA 650 

GACGAAGAAG GTCAAGATGT GTTATTGTTC GTCGATAACA TTTTTAGATT 700 

40 CACCCAAGCC GGTTCCGAAG TGTCTGCTTT GTTAGGTCGT ATTCCATCGG 750 

CTGTCGGTTA TCAACCAACC TTAGCCACTG ATATGGGTCT TTTACAAGAA 800 

AGAATTACCA CCACCACCAA GGGTTCCGTT ACTTCTGTCC AAGCTGTCTA 850 

CGTCCCAGCC GATGATTTAA CCGATCCTGC TCCAGCTACC ACTTTCGCCC 900 

ACTTGGATGC TACCACTGTG TTGTCTCGTG GTATCTCTGA ATTGGGTATT 950 

45 TACCCAGCTG TCGATCCATT GGATTCCAAA TCCAGATTGT TAGATGCTGA 1000 

TATCGTTGGT AAAGAACACT ACGAAGTTGC CACTGGTGTC CAACAAACCT 1050 

TACAAGCTTA CAAATCTTTA CAAGATATCA TTGCTATTTT AGGTATGGAT 1100 

GAATTGTCTG AAGCCGATAA ATTGACTGTC GAAAGAGCCA GAAAGATCCA 1150 

AAGATTCTTG TCTCAACCAT TCGCCGTTGC CGAAGTTTTC ACCGGTATCC 12 00 

50 CAGGTAGATT AGTTAGATTG CAAGACACTG TTAAATCTTT CAAGGAAGTC 1250 

TTAGAAGGTA AATATGATCA CTTACCAG 1278 



55 2) INFORMATION FOR SEQ ID NO: 463 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1154 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

282 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(D) TOPOLOGY: Linear 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida glabra ta 

(B) STRAIN: ATCC 66032 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 463 

TCTGGTCAGA GGCGAGAAGG TCGTCGACAC AGGTGCCCCA ATCTCCGTCC 50 

CTGTCGGCAG AGAGACCCTG GGCAGAATCA TCAACGTTAT CGGTGAACCT 100 

ATCGACGAGC GTGGCCCAAT CAACTCAAAG TTGAGAAAGC CTATCCACGC 150 

CGACCCTCCT TCCTTCGCAG AACAGTCCAC CGCCGCCGAA GTCCTGGAAA 2 00 

CAGGTATCAA GGTCGTCGAC TTGCTGGCCC CTTACGCCAG AGGTGGTAAG 250 

ATCGGTCTGT TCGGTGGTGC CGGTGTCGGT AAGACCGTGT TCATCCAAGA 300 

ACTGATCAAC AACATCGCAA AGGCTCACGG TGGTTTCTCC GTGTTCACAG 3 50 

GTGTCGGTGA AAGAACCAGA GAAGGTAACG ATTTGTACAG AGAAATGAAG 400 

GAAACCGGTG TCATCAACTT GGAAGGTGAC TCTAAGGTCG CCTTGGTCTT 4 50 

CGGCCAAATG AACGAACCAC CAGGAGCCAG AGCCAGAGTC GCCTTGACCG 500 

GTTTGACCAT CGCAGAATAC TTCAGAGATG AAGAAGGTCA AGATGTCCTG 550 

CTGTTCGTCG ACAACATTTT CAGATTCACC CAAGCCGGTT CAGAAGTCTC 600 

CGCTTTGCTA GGTCGTATCC CATCCGCCGT CGGTTATCAA CCAACCTTGG 650 

CCACCGATAT GGGTCTGTTG CAAGAAAGAA TTACCACCAC AAAGAAGGGT 700 

TCCGTCACTT CCGTCCAAGC CGTCTACGTG CCTGCAGATG ATTTAACAGA 750 

TCCTGCCCCT GCCACTACTT TCGCGCACTT GGACGCCACC ACCGTCTTGT 800 

CCAGAAGTAT CTCAGAATTG GGTATCTACC CAGCTGTCGA CCCATTGGAC 850 

TCCAAGTCTA GATTGC TAGA CGCTGCCGTT GTCGGTGAAG AGCATTACAA 900 

CGTCGCCACA AAGGTCCAAG AAACTTTACA AACTTACAAG TCTCTGCAAG 950 

ATATCATCGC CATTTTGGGT ATGGATGAAT TGTCCGAACA AGATAAGCTA 1000 

ACTGTCGAAA GAGCAAGAAA GATCCAAAGA TTCTTGTCCC AACCATTCGC 1050 

TGTCGCTGAA GTTTTCACCG GTATCGAAGG TAAGCTGGTC AGATTGAAGG 1100 

ACACCATCTC CTCTTTCAAG GCTGTCTTGG AAGGTAAGTA CGATGATCTT 1150 

CCAG 1154 



2) INFORMATION FOR SEQ ID NO: 464 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida guilliermondii 

(B) STRAIN: ATCC 6260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464 

CCACTACGAG GACGGTAACC TTCCTGCTAT TTTCAACGCC TTGACTCTTA 50 

AGAACGGTGA CCAAAACTTG GTTTTGGAAG TTGCCCAGCA TTTGGGTGAA 100 

AACACCGTCA GAACCATTGC TATGGATGGT ACTGAAGGTT TGGTTAGAGG 150 

TGCCAGCGTC ACTGACACTG GTGCTCCTAT CTCTGTGCCT GTTGGTCGTG 200 

GTACTTTGGG TCGTATCATC AACGTTATTG GTGAGCCAAT TGACGAGCGT 250 

GGACCAATCG AGTCCAAGCA AAAGAAGCCC ATTCACGCTG AACCACCATC 3 00 

GTTCGTCGAA CAATCCACTT CTGCCGAGGT TTTGGAAACC GGTATCAAGG 350 

283 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/C A00/01 150 

TTGTCGACTT GTTGGCTCCA TACGCCAGAG GTGGTAAGAT TGGATTGTTC r "4 00 

GGTGGTGCCG GTGTCGGTAA GACTGTGTTT ATCCAAGAGT TGATTAACAA 450 

CATTGCCAAG GCTCACGGTG GTTTCTCCGT GTTCACCGGT GTCGGTGAAA 500 

GAACCAGAGA AGGTAACGAT TTGTACCGTG AAATGAAGGA AACTGGTGTC 550 

5 ATCAACTTGG AAGGTGAATC CAAGGTGGCC TTGGTGTTCG GTCAAATGAA 600 

CGAACCTCCA GGAGCTAGAG CCAGAGTTGC CCTTACCGGT TTGACCATCG 650 

CTGAATACTT CAGAGATGAG GAGGGTCAAG ATGTGTTGTT GTTCGTCGAC 700 

AACATTTTCA GATTCACTCA AGCTGGTTCT GAAGTGTCGG CTTTGTTGGG 750 

TCGTATTCCT TCGGCTGTCG GTTACCAACC TACTTTGGCC ACCGATATGG 800 

10 GTTTGTTGCA AGAGCGTATT ACCACCACCA AGAAGGGTTC CGTCACCTCT 850 

GTCCAAGCTG TCTATGTGCC AGCCGATGAT TTGACCGATC CTGCTCCTGC 900 

TACTACTTTT GCTCACTTGG ATGCTACCAC TGTGTTGTCT AGAGGTATCT 950 

CCGAGTTGGG TATTTACCCA GCTGTCGATC CTTTGGATTC CAAGTCGAGA 1000 

TTGTTGGATG CCTCTGTTGT CGGTGAGGAG CACTACTCGG TTGCTTCTAA 1050 

15 CGTTCAACAA ACCTTGCAAG CTTACAAGTC TTTGCAAGAT ATCATTGCCA 1100 

TTTTGGGTAT GGACGAATTG TCGGAAGCTG ACAAGTTGAC CGTCGAGAGA 1150 

GCCCGTAAGA TCGAGAGATT CTTGTCTCAA CCATTTGCTG TTGCCGAAGT 1200 

TTTCACTGGT ATCAGTGGTA AGTTGGTCAG ATTGGAGGAC ACTATCAGAT 1250 

CTTTCAAGGA AGTCTTGGAA GGTAAGTACG ATCACTTGCC AGA 12 93 

20 



2) INFORMATION FOR SEQ ID NO: 465 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida haemulonii 
35 (B) STRAIN: ATCC 22991 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 465 

CACTTTGACG ATGGTAACTT . GCCAGCCATT TTCAACGCCT TGAAGTTGAA 50 

40 GAACGGTGAC CAGGACTTGG TCTTGGAGGT CGCCCAGCAC TTGGGTGAGA 100 

ACACCGTCAG AACCATTGCC ATGGACGGTA CCGATGGTTT GGTCAGAGGC 150 

GAGGCTGTCA CTGACACTGG TGCTCCAATC TCCGTGCCTG TTGGTCGTGA 2 00 

GACTTTGGGT CGTATCATCA ACGTTATTGG TGAGCCAATT GACGAGAGAG 250 

GACCAATCAA GTCCAAGAAG AGAAACCCAA TCCACACTGA CCCACCAACC 300 

45 TTCGTTGAGC AGTCTACTTC TGCTGAGGTT TTGGAGACTG GTATTAAGGT 3 50 

TGTCGACTTG TTGGCCCCTT ACGCCAGAGG TGGTAAGATT GGTTTGTTCG 400 

GTGGTGCCGG TGTCGGTAAG ACCGTCTTTA TCCAAGAGTT GATTAACAAC 450 

ATTGCCAAGG CCCACGGTGG TTTCTCTGTC TTTACCGGTG TCGGTGAGAG 500 

AACCAGAGAA GGTAACGATT TGTACCGTGA AATGCAGGAG ACCGGTGTCA 550 

50 TCAACTTCGA GGGTGACTCC AAGGTCGCCT TGGTCTTCGG TCAGATGAAC 600 

GAGCCACCAG GAGCTAGAGC TAGAGTTGCT TTGACCGGTT TGACCATTGC 650 

CGAATACTTC AGAGATGAAG AAGGTCAGGA TGTGTTGTTG TTCGTTGACA 700 

ACATTTTCAG ATTCACTCAG GCTGGTTCCG AGGTGTCCGC CTTGTTGGGT 750 

CGTATTCCAT CTGCTGTCGG TTACCAGCCA ACCTTGGCCA CCGATATGGG 8 00 

55 TACCTTGCAA GAAAGAATTA CCACCACCAA GAAGGGTTCC GTCACCTCTG 850 

TCCAGGCCGT TTACGTGCCA GCTGATGATT TGACCGATCC TGCCCCAGCT 900 

ACCACTTTCG CTCACTTGGA TGCTACCACT GTGTTGTCTA GAGGTATTTC 950 

CGAGTTGGGT ATCTACCCAG CTGTCGACCC ATTGGACTCC AAGTCTAGAT 1000 

TGTTGGATGC CTCTGTTGTC GGTAAGGAGC ACTACGATGT TGCTACCAAC 1050 

60 GTCCAGCAGA CCTTGCAGGC CTACAAGTCC TTGCAGGATA TCATTGCCAT 1100 

284 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TTTGGGTATG GATGAATTGT CCGAAGCCGA 
CTAGAAAGAT TCAGAGATTC TTGTCCCAGC 
TTCACTGGTA TCGAGGGTAA GTTGGTCAGA 
TTTCAAGGAG GTCTTGGATG GTAAGTACGA 



PCT/CA00/01150 

CAAGTTGACT GTCGAGAGAG 115CT 

CATTCGCCGT CGCCGAGGTT 1200 

TTGGAGGAGA CCGTCAAGTC 1250 

CCACTTGCCA GAG 12 93 



2) INFORMATION FOR SEQ ID NO: 466 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1111 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida inconspicua 
20 (B) STRAIN: ATCC 16783 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466 

AACGGTGGAT CTAAGTTAGT CTTAGAAGTT GCTCAACATT TGGGTGAAAA 50 

25 CACTGTCAGA ACCATTGCTA TGGATGGTAC TGAAGGTTTA GTTAGAGGTC 100 

AACCAGTTAA TGATACTGGT GCTCCAATCT CTGTCCCAGT CGGTAGAGGT 150 

ACTTTAGGTA GAATC TTAAA CGTTATTGGT GATCCAGTCG ATGAAAGAGG 2 00 

TCCAATTGAT TGTAAGGAAA GAAAACCAAT TC AT CAAGAT CCTCCTGCTT 250 

TCGTTGAACA ATCTACTGAA GCTGAAGTTT TAGAAACCGG TATTAAGGTT 300 

30 GTTGATTTAT TAGCTCCTTA CGCTAGAGGT GGTAAGATTG GTTTATTCGG 3 50 

TGGTGCTGGT GTCGGTAAAA CCGTTTTCAT TCAAGAATTA ATTAACAATG 400 

TTGCAAAGGC TCATGGTGGT TTCTCAGTTT TCACTGGTGT CGGTGAAAGA 4 50 

ACCAGAGAAG GTAATGATTT ATACAGAGAA ATGAAGGAAA CTGGTGTTAT 500 

TAACTTAGAA GGTGAATCTA AGGTCGCCTT AGTTTTCGGT CAAATGAATG 550 

35 AACCACCAGG AGCAAGAGCA AGAGTTGCTT TAACCGGTTT AACTATTGCT 600 

GAATACTTCA GAGATGAAGA AGGTCAAGAT GTCTTGTTAT TCATTGATAA 650 

CATTTTCAGA TTTACTCAAG CAGGTTCTGA AGTTTCTGCA TTGTTAGGTA 7 00 

GAATTCCATC CGCTGTCGGT TATCAACCAA CTTTAGCAAC TGATATGGGT 750 

CTTTTACAAG AAAGAATTAC AACTACTAAG AAAGGTTCCG TTACTTCTGT 8 00 

4 0 CCAAGCAGTT TATGTCCCAG CAGATGATTT AACTGATCCT GCTCCAGCAA 850 

CTACTTTCGC CCACTTAGAT GCAACTACTG TCTTATCTAG AGGTATTTCC 900 

GAATTAGGTA TTTACCCAGC TGTCGATCCA TTAGATTCTA AATCTAGATT 950 

ATTAGATGTT GCTGTTGTTG GTCAAGAACA TTATGATGTT GCAACTCAAG 1000 

TTCAAGAAAC TTTACAAGCA TACAAATCTT TACAAGATAT TATTGCTATT 1050 

45 TTAGGTATGG ATGAATTATC TGAACAAGAT AAATTAACCG TTGAAAGAGC 1100 

AAGAAAGATT C 1111 



50 2) INFORMATION FOR SEQ ID NO: 467 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 83 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

285 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

(A) ORGANISM: Candida kefyr 

(B) STRAIN: ATCC 28838 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467 

5 

TCGAACAAGG TCAATTGCCA GAAATTTTGA ACGCTTTGGA GATTGAWACT 50 

CCTCAAGGTA AGTTGGTTTT GGAAGTTGCC CAACATTTGG GTGAAAACAC 100 

CGTCAGAACC ATTGCTATGG ACGGTACCGA AGGTTTGGTC CGTGGTGAGA 150 

AGGTTTTGGA CACTGGTGCT CCAATTTCCG TCCCAGTCGG TAGAGAAACT 2 00 

10 TTGGGTAGAA TCATCAACGT TATTGGTGAG CCAATTGACG AAAGAGGCCC 250 

AATCAAGTCC AAGATGAGAA AGCCAATTCA CGCTGACCCT CCATCCTTTG 300 

TTGAACAATC CACTGCTGCT GAAGTTTTGG AAACCGGTAT CAAGGTTGTC 350 

GACTTGTTGG CCCCATACGC CAGAGGTGGT AAGATTGGTT TGTTCGGTGG 400 

TGCCGGTGTC GGTAAGACCG TTTTCATCCA AGAGTTGATT AACAACATTG 450 

15 CCAAGGCCCA TGGTGGTTTC TCCGTCTTCA CCGGTGTCGG TGAAAGAACC 500 

AGAGAAGGTA ACGATTTGTA CCGTGAAATG AAGGAAACCG GTGTCATCAA 550 

CTTGGAAGGT GACTCCAAGG TCGCCTTGGT CTTCGGTCAA ATGAACGAAC 600 

CACCTGGAGC TAGAGCCAGA GTTGCCTTGA CCGGTTTGAC TATCGCTGAA 650 

TACTTCAGAG ATGAAGAAGG TCAAGATGTG TTGTTGTTTA TCGACAACAT 700 

20 TTTCAGATTC ACACAAGCCG GTTCCGAAGT GTCCGCTTTG TTGGGTCGTA 750 

TTCCATCCGC TGTCGGTTAC CAACCTACTT TGGCCACCGA TATGGGTTTG 800 

TTGCAAGAAA GAATTACCAC TACCAAGAAG GGTTCCGTTA CCTCCGTCCA 850 

AGCTGTCTAC GTCCCTGCTG ATGATTTGAC TGATCCTGCT CCAGCTACYA 900 

CTTTCGCCCA TTTGGACGCC ACCACCGTGT TGTCCAGAGG TATCTCCGAA 950 

25 TTGGGTATCT ACCCAGCTGT CGATCCATTG GATTCCAAGT CTAGATTGTT 1000 

GGACGCTGCC GTTGTCGGTC AAGAACATTA CGACGTCGCT ACTCAAGTTC 1050 

AACAAACTTT GCAAGCTTAC AAGTCTTTGC AAGATATCAT TGCCATTTTG 1100 

GGTATGGATG AATTGTCTGA ACAAGACAAG TTGACTGTCG AAAGAGCCAG 1150 

AAAGATCCAA AGATTCTTGT CTCAACCATT CGCCGTCGCC GAAGTTTTCA 12 00 

30 CTGGTATCCC AGGTAGATTG GTCAGATTAA AGGACACCAT CGCTTCCTTC 12 50 

AAGGCTGTTT TGGAAGGTAA GTACGATCAC TTG 1283 



35 2) INFORMATION FOR SEQ ID NO: 468 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1287 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida krusei 

(B) STRAIN: ATCC 34135 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468 



CTTCGAACAA GGCCAATTAC CACAAATTTT AAACGCTTTA GTTATGGATA 50 

ATGGTGGTAA CAAGTTAGTT TTAGAAGTTG CTCAACATTT AGGTGAAAAC 100 

ACTGTCAGAA CCATTGCTAT GGATGGTACT GAAGGTTTAG TTAGAGGTCA 150 

AACCGTTAAC GATACCGGTG CTCCAATCTC TGTCCCAGTT GGTAGAGGTA 2 00 

55 CCTTAGGTAG AATCTTGAAC GTCATTGGTG ATCCAGTCGA TGAAAGAGGT 250 

CCAGTTGACT GTAAGGAAAG AAAGCCAATT CACGCTGATC CTCCAGCTTT 300 

CGTTGAACAA TCCACTGAAG CTGAAGTTTT GGAAACTGGT ATTAAGGTTG 350 

TCGATTTATT AGCACCTTAC GCAAGAGGTG GTAAGATTGG TTTATTCGGT 4 00 

GGTGCTGGTG TTGGTAAGAC CGTTTTTATC CAAGAATTGA TCAACAATGT 450 

60 YGCAAAGGCT CATGGTGGTT TCTCCGTTTT CACTGGTGTT GGTGAAAGAA 500 

286 
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CCAGAGAAGG TAACGATTTA TACAGAGAAA TGAAGGAAAC TGGTGTTATT 550 

AACTTGGAAG GTGAATCCAA GGTCGCCTTA GTTTTCGGTC AAATGAACGA 600 

ACCACCAGGA GCTAGAGCAA GAGTTGCTTT AACTGGTTTG ACCATTGCAG 650 

AATATTTCAG AGATGAAGAA GGTCAAGATG TCTTGTTATT CATTGATAAC 700 

5 ATTTTCAGAT TCACCCAAGC AGGTTCTGAA GTCTCTGCAT TATTAGGTAG 750 

AATTCCATCT GCTGTCGGTT ATCAACCAAC TTTAGCAACC GATATGGGTC 800 

TTTTACAAGA AAGAATTACC ACCACCAAGA AGGGTTCCGT TACTTCTGTC 850 

CAAGCTGTTT ATGTCCCAGC CGATGATTTA ACCGATCCTG CTCCAGCTAC 900 

TACTTTCGCC CACTTGGATG CAACCACTGT CTTGTCYAGA GGTATTTCCG 950 

10 AATTAGGTAT CTACCCAGCT GTCGATCCAT TAGATTCTAA GTCTAGATTA 1000 

TTAGATGTTG CAGTTGTTGG TCAAGAACAT TATGAAGTTG CAACTCAAGT 1050 

CCAAGAAACT TTACAAGCTT ACAAGTCTTT ACAAGATATT ATTGCTATTT 1100 

TGGGTATGGA TGAATTATCT GAACAAGATA AGTTAACYGT TGAAAGAGCA 1150 

AGAAAGATCC AAAGATTCTT ATCTCAACCA TTCTCTGTTG CAGAAGTTTT 1200 

15 CACTGGTATT CCAGGTAAGT TAGTCAGATT AGAAGAAACC ATCAAGTCTT 1250 

TCAGGGATGT TCTTGCAGGT AAGTACGATC ACTTACC 1287 



20 2) INFORMATION FOR SEQ ID NO: 469 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1283 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida lawbica 

(B) STRAIN: ATCC 24750 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 469 



TCGAACAAGG TCAATTACCA CCAATTCTTA ACGCTTTGGT CATGGAAAAC 50 

GATGGTCAAA AGTTAGTTTT GGAAGTTGCT CAACATTTGG GTGAAAACAC 100 

CGTCAGAACC ATTGCTATGG ACGGTACCGA AGGTTTAGTT AGAGGTCAAC 150 

CTGTTAACGA CACTGGTGCT CCAATCTCTG TCCCAGTTGG TAGAGGTACT 200 

40 CTTGGTAGAA TCTTGAACGT CACTGGTGAC CCAGTYGATG AAAGAGGTCC 250 

TGTCGAMTGT AAGGAGAGAA GACCAATTCA CCAAGACCCA CCTGCTTTCG 3 00 

TTGACCAATC CACTGAAGCT GAAGTTTTGG AAACCGGTAT TAAGGTTGTC 350 

GATTTATTAG CACCTTACGC TAGAGGTGGT AAGATTGGTT TGTTCGGTGG 400 

TGCTGGTGTC GGTAAGACCG TTTTCATCCA AGAATTGATT AACAATGTTG 450 

45 CTAAGGCCCA CGGTGGTTTC TCCGTTTTCA CTGGTGTCGG TGAAAGAACC 500 

AGAGAAGGTA ACGATTTATA CAGAGAAATG AAGGAAACCG GTGTTATTAA 550 

CCTTGAAGGT GAATCTAAGG TCGCCCTCGT TTTCGGTCAA ATGAACGAAC 600 

CACCAGGAGC AAGAGCTAGA GTTGCTCTTA CTGGTTTGAC CATTGCTGAA 650 

TACTTCAGAG ATGAAGAAGG TCAAGATGTC TTGCTTTTCA TTGACAACAT 700 

50 TTTCAGATTC ACCCAAGCAG GTTCCGAAGT GTCTGCTTTG TTAGGTAGAA 750 

TTCCWTCTGC CGTTGGTTAC CAACCAACCT TAGCAACCGA TATGGGTTTG 800 

TTACAAGAAA GAATTACCAC CACCAAGAAG GGTTCCGTCA CCTCCGTCCA 850 

AGCTGTTTAC GTCCCAGCTG ATGATTTRAC TGACCCTGCA CCAGCAACCA 900 

CTTTCGCCCA CTTGGATGCT ACCACCGTCT TGTCTAGAGG TATTTCCGAA 950 

55 TTAGGTATTT ACCCAGCTGT CGATCCATTA GATTCTAAGT CTAGATTATT 1000 

GGATGTCGCT GTTGTCGGTC AAGAACATTA TACCGTTGCA ACYCAAGTCC 1050 

AAGAAACTTT ACAAGC TTAC AAGTCCTTAC AAGATAT CAT TGCTATTTTG 1100 

GGTATGGACG AATTATCTGA ACAAGATAAG CTTACTGTTG AAAGAGCAAG 1150 

AAAGATCCAA AGATTCCTTT CCCAACCTTT CTCCGTCGCA GAAGTTTTCA 1200 

60 CTGGTATCCC AGGTAAGCTT GTCAGATTAG AAGAAACCAT TATTTCTTTC 1250 
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AGAGATGTCC TCGATGGTAA GTACGACCAC TTA 



PCT/CA00/01150 
1283 



2) INFORMATION FOR SEQ ID NO: 470 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1140 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida lusitaniae 

(B) STRAIN: ATCC 66035 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470 

AGGTGCCTCT GTCACTGACA CTGGTTCTCC AATCTCTGTC CCTGTTGGTC 50 

GTGAAACCTT GGGTAGAATT ATCAACGTTG TTGGTGAGCC AATTGACGAG 100 

AGAGGCCCAA TCAACTCCAA GAAGAGAAAC CCAATTCACA CTGAGCCACC 150 

ATCGTTTGTT GAACAATCCA CTTCTGCTGA AGTTTTGGAG AC TGGT ATC A 2 00 

AGGTTGTCGA CTTGTTGGCC CCTTACGCCA GAGGTGGTAA GATTGGTTTG 250 

TTCGGTGGTG CCGGTGTCGG TAAGACCGTT TTCATCCAAG AGTTGATTAA 3 00 

CAACATTGCC AAGGCCCACG GTGGTTTCTC TGTTTTCACT GGTGTCGGTG 3 50 

AAAGAACCAG AGAAGGTAAC GATTTGTACC GTGAAATGCA AGAGACCGGT 400 

GTCATCAACT TCGAGGGTGA CTCCAAGGTC GCCTTGGTCT TCGGTCAAAT 450 

GAACGAACCA CCAGGAGCCA GAGCTAGAGT TGCTTTGACC GGTTTGACTA 500 

TTGCCGAGTA CTTCAGAGAC GAAGAGGGCC AAGATGTCTT GTTGTTCGTT 550 

GACAACATTT TCAGATTCAC CCAGGCCGGT TCTGAAGTGT CTGCTTTGTT 600 

GGGTCGTATT CCATCCGCTG TCGGTTACCA ACCAACCTTG GCCACCGATA 650 

TGGGTGCTTT GCAAGAGAGA ATTACCACCA CCAAGAAGGG TTCCGTCACC 700 

TCTGTCCAAG CCGTTTATGT TCCAGCTGAT GACTTGACTG ACCCTGCTCC 750 

AGCCACCACC TTCGCCCACT TGGACGCCAC CACTGTGTTG TCCAGAGGTA 800 

TCTCTGAATT GGGTATCTAC CCAGCTGTCG ACCCATTGGA CTCCAAGTCT 850 

AGATTGTTGG ACGCTTCTAT TGTTGGTAAG GAGCACTACG AAGTTGCTTC 90 0 

TAACGTTCAA CAAACTTTGC AAGCTTACAA GTCTTTGCAA GATATCATTG 950 

CCATTTTGGG TATGGATGAA TTGTCTGAGG CTGACAAGTT GACCGTTGAG 100 0 

AGAGCCAGAA AGATC CAAAG ATTCTTGTCT CAACCATTCG CTGTTGCCGA 1050 

GGTTTTCACT GGTATCCCAG GTAGATTGGT CAGATTGGAG GACACTGTCA 110 0 

GATCCTTCAA GGAAGTTTTG GACGGTAAGT ACGACCACTT 114 0 



2) INFORMATION FOR SEQ ID NO: 471 

( i ) SEQUENCE CHARACTER I STI CS : 

(A) LENGTH: 12 96 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida norvegensis 

(B) STRAIN: ATCC 22977 

. 288 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471 



PCT/CA00/01150 



TCAATTCGAA CAAGGTCAAT TACCACCAAT TTTAAACGCT TTAGTCATGG 50 

ACAACGGTGG AACTAAGTTA GTTTTAGAAG TTGCTCAACA TTTAGGTGAA 100 

5 AACACCGTCA GAACCATTGC TATGGATGGT ACTGAAGGTT TAGTTAGAGG 150 

TCAACCAGTT AATGATACTG GTGCTCCAAT CTCTGTCCCA GTCGGTAGAG 200 

GTACTTTAGG TAGAATCTTA AACGTTATTG GTGATCCAGT CGATGAAAGA 250 

GGTCCAATCG ATTGTAAGGA AAGAAAACCA ATTCATCAAG ATCCTCCTGC 300 

TTTCGTTGAA CAATCAACTG AAGCTGAAGT TTTAGAAACT GGTATTAAAG 350 

10 TTGTCGATTT ATTAGCTCCT TACGCTAGAG GTGGTAAGAT TGGTTTATTC 400 

GGTGGTGCAG GTGTTGGTAA AACCGTTTTT ATTCAAGAAT TAATTAACAA 450 

TGTTGCAAAA GCTCATGGTG GTTTCTCCGT TTTCACTGGT GTCGGTGAAA 500 

GAACTAGAGA AGGTAACGAT TTATACAGAG AAATGAAGGA AACTGGTGTT 550 

ATTAATTTAG AAGGTGATTC TAAGGTCGCA TTAGTTTTCG GTCAAATGAA 600 

15 CGAACCACCT GGAGCAAGAG CAAGAGTTGC TTTAACTGGT TTAACTATTG 650 

CTGAATATTT CAGAGATGAA GAAGGTCAAG ATGTCTTGTT ATTCATTGAT 700 

AACATTTTCA GATTTACTCA AGCAGGTTCT GAAGTTTCCG CTTTGTTAGG 750 

TAGAATTCCA TCCGCTGTCG GTTATCAACC AACTTTAGCA ACTGATATGG 800 

GTCTTTTACA AGAAAGAATT ACTACTACTA AGAAAGGTTC TGTTACTTCC 850 

20 GTTCAAGCAG TTTATGTCCC AGCAGATGAT TTAACTGATC CTGCTCCAGC 900 

AACTACTTTC GCCCACTTAG ATGCAACTAC TGTCTTATCT AGAGGTATTT 950 

CCGAATTAGG TATTTACCCA GCTGTCGATC CATTAGATTC TAAATCTAGA 1000 

TTATTAGATG TTGCAGTTGT TGGTCAAGAA CATTATGATG TTGCAACTCA 1050 

AGTTCAAGAA ACTTTACAAG CTTACAAATC TTTACAAGAT ATTATTGCTA 1100 

25 TTTTAGGTAT GGATGAATTA TCTGAACAAG ATAAATTAAC CGTTGAAAGA 1150 

GCTAGAAAAA TTCAAAGATT CTTATCTCAA CCTTTCTCTG TCGCAGAAGT 1200 

TTTCACTGGT ATCCCAGGTA AGCTTGTTAG ATTAGAAGAA ACTATTTCTT 1250 

CATTCAGAGA TGTCTTAGCA GGTAAGTACG ATCACTTACC AGAAAA 1296 



30 



45 



2) INFORMATION FOR SEQ ID NO: 472 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1297 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida parapsilosis 

(B) STRAIN: ATCC 90018 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472 



ACACTTCGAA GAAGGAAACT TGCCAGCTAT TTTGAACGCT TTGACCTTGA 50 

AAAACGGTAA ACAAGACTTG GTCTTGGAAG TTGCCCAACA TTTGGGTGAA 100 

50 AACACCGTCA GAGCAATTGC TATGGATGGT ACTGAAGGTT TAGTTAGAGG 150 

TACTTCTGTC ACTGACACTG GTGCCCCAAT TTCTGTCCCA GTTGGTAGAG 200 

GTACTTTGGG TAGAATCATC AATGTTACTG GTGACCCAAT TGATGAAAGA 250 

GGTCCAATTG AATGTAAGAA GAGAAACCCA ATCCACGCTG AACCACCTTC 300 

ATTCATTGAA CAATCCACTG CTGCTGAAGT TTTGGAAACC GGTATTAAGG 3 50 

55 TTGTCGACTT GTTGGCTCCA TACGCTAGAG GTGGTAAGAT TGGTTTGTTC 40 0 

GGTGGTGCCG GTGTCGGTAA AACCGTGTTT ATCCAAGAAT TGATCAACAA 450 

CATTGCCAAG GCACATGGTG GTTTCTCGGT TTTCACTGGT GTTGGTGAAA 500 

GAACCAGAGA AGGTAACGAT TTGTACCGTG AAATGAAGGA AACTGGTGTT 550 

ATCAACTTGG AAGGTGACTC CAAGGTCGCT TTGGTTTTCG GTCAAATGAA 600 

60 CGAACCACCT GGAGCTAGAG CTAGAGTTGC TTTGACTGGT TTGACCATTG 650 
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CTGAATACTT TAGAGATGAA GAAGGTCAAG ATGTCTTGTT GTTTATTGAC ' 700* 

AACATTTTCA GATTCACCCA AGCTGGTTCA GAAGTGTCTG CTTTGTTGGG 750 

TCGTATTCCA TCCGCTGTCG GTTATCAACC AACCTTGGCC ACCGATATGG 800 

GTCTTTTGCA AGAACGTATT ACCACTACCA AAAAGGGTTC AGTTACTTCA 850 

5 GTCCAAGCCG TTTACGTGCC AGCTGATGAT TTAACCGATC CTGCTCCAGC 900 

TACCACTTTC GCTCACTTGG ATGCCACCAC TGTGTTGTCT AGAGGTATTT 950 

CGGAGTTGGG TATTTACCCA GCTGTCGATC CATTGGATTC CAAATCCAGA 1000 

TTGTTGGATG CTGCCGTTGT TGGTCAAGAA CATTACGACG TTGCCACTGG 1050 

TGTTCAACAA ACTTTACAAG CTTACAAGTC CTTGCAAGAT ATCATTGCTA 1100 

10 TCTTGGGTAT GGATGAATTG TCCGAACAAG ATAAATTGAC TGTCGAAAGA 1150 

GCTCGTAAGA TTCAAAGATT CTTGTCTCAA CCATTTGCTG TCGCTGAAGT 1200 

TTTCACTGGT ATTCCAGGTA AATTGGTTAG ATTGTCTGAA ACTGTCAAGT 1250 

CATTCAAGGA AGTCTTGGAA GGTAAGTACG ATAACTTGCC AGAAAAT 12 97 



15 



30 



60 



2) INFORMATION FOR SEQ ID NO: 473 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1285 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida rugosa 

(B) STRAIN: ATCC 96275 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 473 



CCAGTTTGGC AACGACCTCC CTGCCATTTT GAACGCCCTC ACTCTTCAGC 50 

GTGAAGACGG TAACAAGCTT GTTCTTGAGG TTGCCCAGCA TCTCGGTGAG 100 

35 AACACCGTCC GTACCATTGC TATGGACGGT ACCGAGGGTT TGGTGCGTGG 150 

CACTGGTGTC CACGACACCG GACACCCCAT CATGACTCCC GTCGGTGACG 200 

GTACCCTGGG ACGTATTCTT AACGTCACCG GTGACCCTGT AGACGAGCGT 250 

GGTCCCGTCA AGACTGACAA GTTCCGCCCC ATCCACGCCG AGGCCCCTGC 3 00 

CTTCGATGAG CAGGCTACCA GTGCCGAGGT TCTTGAGACC GGTATCAAGG 3 50 

4 0 TTGTCGACTT GCTCGCTCCT TACGCCAAGG GTGGTAAGAT TGGTTTGTTC 400 

GGTGGTGCCG GTGTCGGCAA GACCGTCTTC ATCCAGGAGC TGATTAACAA 450 

CATCGCCAAG GCCCACGGTG GTTACTCCGT GTTCACTGGT GTCGGTGAGC 500 

GTACTCGTGA GGGTAACGAT TTGTACAAGG AAATGATCGA GTCCGGTGTC 550 

ATCAACCTCG ATGGTGAGTC CAAGGTCGCC TTGGTGTTCG GTCAGATGAA 600 

45 CGAGCCCCCT GGAGCCCGTG CCCGTGTCGC CCTTACCGGT CTTACCATCG 650 
CTGAGTATTT CCGTGATGAG GAGGGTAAGG ATGTCTTGTT GTTCGTTGAC - 700 

AACATTTTCC GCTTCACTCA GGCCGGTTCT GAGGTGTCCG CCTTGCTTGG 750 

TCGTATTCCT TCCGCTGTCG GTTACCAGCC TACCCTGGCC ACCGATATGG 800 

GTGCCCTTCA GGAGCGTATC ACCACCACCA AGAAGGGTTC CGTTACATCC 850 

50 GTCCAGGCCG TCTACGTCCC TGCCGATGAT TTGACTGATC CCGCCCCTGC 900 

CACCACCTTC GCCCATTTGG ATGCCACCAC TGTCTTGTCT CGTGCCATCT 950 

CTGAGTTGGG TATCTACCCC GCTGTCGACC CTCTCGACTC CAAGTCCCGT 1000 

CTTCTTGACG CCGCTGTCGT TGGTCAGGAG CACTACGATA CTGCCACCTC 1050 

CGTTCAGCAG ACTTTGCAGG CTTACAAGTC TTTGCAGGAT ATCATTGCCA 1100 

55 TTCTTGGTAT GGATGAGTTG TCCGAGTCTG ACAAGCTCAC CGTCGAGCGT 1150 

GCTCGTAAGA TCCAGCGTTT CCTCTCCCAG CCTTTCGCTG TTGCTGAGGT 1200 

CTTCACTGGT ATTCAGGGCC GTCTTGTTCC TCTCAAGGAC ACTGTCCGCT 1250 

CCTTCAAGGA GATTCTCGAA GGTAAGTACG ATGCT 1285 
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2 ) INFORMATION FOR SEQ ID NO: 474 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1283 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida sphaerica 

(B) STRAIN: ATCC 2504 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 474 

TTGAACAAGG TCAATTGCCA GCTATTTTGA ACGCTTTGGA AATCGACACT 50 

CCAGAAGGAA AGTTGGTTTT GGAAGTCGCT CAACATTTGG GTGAAAACAC 100 

TGTCAGAACC ATTGCTATGG ATGGTACTGA AGGTTTAGTC CGTGGTGAAA 150 

ACGTTTCTGA CACTGGTGCT CCAATTTCCG TCCCAGTTGG TAGAGAAACC 200 

TTGGGTAGAA TTATCAACGT TATTGGTGAG CCAATTGACG AAAGAGGTCC 250 

AATCAACTCC AAGATGAGAA AGCCAATTCA TGCTGATCCT CCATTATTCG 3 00 

TTGAACAATC CACTGCTGCT GAAGTTTTGG AAACTGGTAT CAAGGTTGTC 350 

GACTTGTTGG CCCCATACGC CAGAGGTGGT AAGATTGGTT TGTTCGGTGG 400 

TGCCGGTGTC GGTAAGACCG TTTTCATCCA AGAATTGATT AACAACATTG 450 

CCAAGGCTCA TGGTGGTTTC TCTGTCTTCA CTGGTGTCGG TGAAAGAACC 500 

AGAGAAGGTA ACGATTTGTA CCGTGAAATG AAGGAAACTG GTGTTATCAA 550 

CTTGGAAGGT GATTCTAAGG TCGCGTTGGT TTTCGGTCAA ATGAACGAAC 600 

CTCCTGGAGC TAGAGCTAGA GTCGCCTTGA CTGGTTTGAC CATCGCTGAA 650 

TACTTCAGAG ATGAAGAAGG TCAAGATGTC TTGTTGTTTA TCGACAACAT 700 

TTTCAGATTC ACACAAGCCG GTTCCGAAGT GTCCGCTTTG TTGGGTCGTA 750 

TTCCATCCGC TGTCGGTTAT CAACCAACTT TGGCCACCGA TATGGGTTTG 800 

TTGCAAGAAA GAATTACTAC CACCAAGAAG GGTTCCGTCA CTTCTGTCCA 850 

AGCCGTTTAC GTGCCAGCTG ATGATTTGAC TGATCCTGCT CCAGCTACCA 900 

CTTTCGCGCA TTTGGATGCC ACCACTGTGT TGTCCAGAGG TATCTCTGAA 950 

TTGGGTATCT ACCCAGCTGT CGATCCTTTG GATTCCAAAT CTAGATTGTT 1000 

GGATGCTGCC GTCGTTGGTC AAGAACATTA CGATGTCGCT ACTCAAGTTC 1050 

AACAAACTTT GCAAGCTTAC AAGTCTTTGC AAGATATCAT TGCCATTTTG 1100 

GGTATGGATG AATTGTCCGA ACAAGATAAG TTGACCGTCG AAAGAGCTAG 1150 

AAAGATTCAA AGATTCTTGT CTCAACCTTT CGCTGTCGCT GAAGTCTTCA 12 00 

CTGGTATCCC AGGTAGATTG GTCAGATTAA AGGACACCAT CTCTTCTTTC 1250 

AAGGCTGTCT TGGACGGTAA GTACGATCAC TTG 12 83 



2) INFORMATION FOR SEQ ID NO: 475 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 90 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida tropicalis 

(B) STRAIN: ATCC 13803 

291 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 475' 

CGACGAAGGT AACTTGCCAG CTATTTTGAA TGCTTTGACT TTGAAGAACG 50 

GTGACCAAGA CTTGGTTTTG GAAGTTGCTC AACATTTGGG TGAAAACACC 100 

5 GTCAGGGCTA TTGCTATGGA TGGTACCGAA GGTTTAGTCA GAGGTACTGC 150 

TGTCACTGAT ACCGGTGCTC CAATTTCTGT CCCAGTTGGT AGAGGTACCT 200 

TGGGTAGAAT TATCAACGTT GTTGGTGAAC CAATTGATGA CAGAGGTCCA 250 

ATTGAATGTA AGGAAAGAAA GCCAATTCAC GCTGAACCAC CTTCATTCGT 300 

TGAACAATCT ACTGCTGCCG AAATTTTGGA AACCGGTATT AAGGTTGTCG 350 

10 ATTTGTTGGC TCCTTACGCT AGAGGTGGTA AGATTGGTTT GTTCGGTGGT 4 00 

GCTGGTGTCG GTAAAACCGT CTTCATCCAA GAATTGATTA ACAACATTGC 450 

CAAAGCTCAT GGTGGTTTCT CTGTCTTTAC CGGTGTCGGT GAAAGAACCA 500 

GAGAAGGTAA CGATTTGTAC CGTGAAATGA AAGAAACTGG TGTCATCAAC 550 

TTGGAAGGTG ACTCCAAGGT CGCCTTGGTC TTCGGTCAAA TGAACGAACC 600 

15 ACCTGGTGCT AGAGCTAGAG TTGCTTTGAC TGGTTTGACC ATTGCTGAAT 650 

ACTTCAGAGA TGAAGAAGGT CAAGATGTCT TGTTGTTCAT TGACAACATT 700 

TTCAGATTCA CCCAAGCTGG TTCCGAAGTG TCTGCTTTGT TGGGTCGTAT 750 

TCCATCTGCT GTCGGTTATC AACCAACCTT GGCCACTGAT ATGGGTCTTT 800 

TGCAAGAACG TATTACCACC ACCAAGAAAG GTTCTGTTAC TTCTGTCCAA 850 

20 GCTGTTTATG TCCCAGCCGA TGATTTAACC GATCCAGCTC CAGCTACCAC 900 

CTTCGCTCAC TTGGATGCCA CTACTGTCTT GTCTAGAGGT ATTTCTGAAT 950 

TGGGTATTTA CCCAGCTGTC GATCCATTGG ATTCTAAATC CAGATTATTG 1000 

GATGCTACTG TTGTTGGTCA AGAACATTAT GATGTTGCCA CTGGTGTTCA 1050 

ACAAACTTTA CAAGCTTACA AGTCCTTGCA AGATATCATT GCTATTTTGG 1100 

25 GTATGGATGA ATTGTCCGAA GC CGATAAAT TGACTGTCGA AAGAGCTCGT 1150 

AAGATTCAAA GATTCTTGTC CCAACCATTC GCTGTTGCCG AAGTTTTCAC 12 00 

TGGTATCCCA GGTAGATTGG TTAGATTACA AGATACTGTT AAATCTTTCA 1250 

AAGATGTCTT GGAAGGTAAA TACGATCACT TGCCAGAAAA 12 90 



30 



45 



2 ) INFORMATION FOR SEQ ID NO: 476 



(i) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 1267 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida utilia 

(B) STRAIN: Csp 388 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 476 



TTTGAACGCC CTTGAGATCA AGAACCACGA GTCTGGAAGA TTGGTCCTCG 50 

AGGTTGCCCA GCACTTGGGT GAGAACACTG TCAGAACTAT TGCCATGGAT 100 

50 GGTACCGAAG GTCTCGTCCG TGGTGAGTCT GTTGTTGACA CCGGTTCTCC 150 

AATTACTGTC CCAGTTGGTC GTGAGACCTT GGGTCGTATC ATCAACGTTA 2 00 

TTGGTGAGCC AATTGATGAG CGTGGTCCAA TCAACAC CAA GCACAGAAAC 250 

CCAATTCACG CTGAGCCACC ATCCTTCGCT GAGCAGTCTA CTGCTGCTGA 3 00 

GGTTTTGGAG ACTGGTATCA AGGTTGTCGA CCTTCTCGCC CCATACGCCA 3 50 

55 GAGGTGGTAA GATTGGTCTC TTTGGTGGTG CCGGTGTCGG TAAGACCGTT 400 

TTCATCCAGG AGTTGATTAA CAACATTGCC AAGGCCCACG GTGGTTTCTC 450 

TGTTTTCACC GGTGTCGGTG AGAGAACCAG AGAGGGTAAC GATTTGTACC 500 

GTGAGATGAA GGAGACTGGT GTCATCAATC TCGAGGGTGA GTCCAAGGTT 550 

GCTCTCGTCT TCGGTCAGAT GAACGAGCCA CCAGGTGCCC GTGCCCGTGT 600 

60 CGCTTTGACC GGTTTGACCA TTGCTGAGTA CTTCAGAGAT GAGGAGGGTC 650 
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AGGATGTCTT GTTGTTTATT GACAACATTT TCAGATTCAC CCAGGCCGGT" 700 

TCCGAGGTGT CTGCCTTGCT TGGTCGTATT CCATCCGCTG TCGGTTACCA 750 

GCCAACTTTG GCCACCGATA TGGGTTTGTT GCAGGAGAGA ATTACCACCA 800 

CCCAGAAGGG TTCCGTCACT TCTGTCCAGG CCGTTTACGT CCCAGCTGAT 850 

5 GATTTGACTG ATCCTGCTCC AGCCACCACT TTCGCCCACT TGGACGCCAC 900 

CACTGTGTTG TCCCGTGGTA TCTCTGAGTT GGGTATTTAC CCAGCTGTCG 950 

ACCCATTGGA CTCCAAGTCC AGATTGTTGG ACGCTGCCGT TGTTGGTGAC 1000 

GAGCACTACA ACACCGCCAC CGATGTCCAG CAGACCCTTC AGGCTTACAA 1050 

GTCTCTCCAG GATATCATTG CTATTTTGGG TATGGATGAG TTGTCTGAGG 1100 

10 CTGACAAGTT GACTGTCGAG AGAGCCAGAA AGATTCAGCG TTTCCTTTCC 1150 

CAGCCATTCG CTGTCGCTGA GGTTTTCACC GGTATCCCAG GTAGATTGGT 1200 

TAGACTTCAG GACACCATCA AGTCCTTCAG AGAGGTTTTG GACGGTAAGT 1250 

ACGACCACTT GCCAGAG 12 67 



15 



30 



60 



2) INFORMATION FOR SEQ ID NO: 477 



( i ) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 12 96 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida viswanathii 

(B) STRAIN: ATCC 28269 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477 



CCAATTCGAC GAAGGTAGCT TGCCAGCTAT CTTGAACGCC TTGACCTTGA 50 

AGAACGGTGA CCAAGACTTG GTCTTGGAAG TTGCCCAACA CTTGGGTGAA 100 

3 5 AACACTGTCA GAGCTATTGC TATGGATGGT ACCGAAGGTT TGGTCAGAGG 150 

TACTGCTGTC ACTGACACCG GTGCTCCAAT CTCCGTCCCA GTCGGTAGAG 200 

GTACCTTGGG TAGAATCATC AACGTTGTTG GTGAACCAAT TGACGACAGA 250 

GGTCCAATTG AATGTAAGGA AAGAAAGCCA ATTCACGCTG AACCACCTTC 3 00 

TTTCGTTGAA CAATCCACTG CTGCCGAAAT TTTGGAAACC GGTATCAAGG 350 

40 TTGTCGACTT GTTGGCCCCA TACGCCAGAG GTGGTAAGAT TGGTTTGTTC 400 

GGTGGTGCCG GTGTCGGTAA GACTGTCTTT ATCCAAGAGT TGATTAACAA 450 

CATTGCCAAG GCCCATGGTG GTTTCTCTGT CTTCACTGGT GTCGGTGAAA 500 

GAACCAGAGA AGGTAACGAT TTGTACCGTG AAATGAAGGA AACCGGTGTC 550 

ATCAACTTGG AAGGTGACTC CAAGGTCGCC TTGGTTTTCG GTCAAATGAA 600 

45 CGAACCACCT GGTGCTAGAG CTAGAGTTGC TTTGACTGGT TTGACCATTG 650 

CCGAATACTT CAGAGACGAA GAAGGTCAAG ATGTCTTGTT GTTTATTGAC 700 

AACATTTTCA GATTCACCCA AGCCGGTTCC GAAGTGTCTG CTTTGTTGGG 750 

TCGTATTCCA TCTGCCGTCG GTTATCAACC AACCTTGGCC ACCGATATGG 800 

GTCTTTTGCA AGAACGTATT ACCACCACCA AGAAGGGTTC CGTCACCTCT 850 

50 GTCCAAGCTG TCTATGTCCC AGCCGATGAT TTGACCGATC CTGCTCCAGC 900 

CACCACCTTC GCTCACTTGG ATGCCACTAC TGTCTTGTCT AGAGGTATTT 950 

CCGAATTGGG TATCTACCCA GCTGTCGATC CATTGGACTC CAAGTCCAGA 1000 

TTGTTGGATG CTGCTGTTGT TGGTCAAGAA CATTACGATG TTGCCACTGG 1050 

TGTCCAACAA ACCTTGCAAG CTTACAAGTC TTTGCAAGAT ATCATTGCCA 1100 

55 TTTTGGGTAT GGATGAATTG TCTGAATCCG ATAAGTTGAC TGTCGAAAGA 1150 

GCTCGTAAGA TCCAAAGATT CTTGTCGCAA CCATTTGCTG TTGCCGAAGT 12 00 

TTTCACTGGT ATCCCAGGTA GATTAGTTAG ATTGCTGGAA ACCGTCCAAT 1250 

CTTTCAAAGA CGTCTTGGCT GGTAAATACG ATCACTTGCC AGAAAA 12 96 
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2) INFORMATION FOR SEQ ID NO: 478 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 95 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida zeylanoides 

(B) STRAIN: ATCC 7351 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 478 

TCAATTCGAG CAAGGCAACC TCCCTGCCAT CTTGAACGCT CTCACCTTGA 50 

AGAATGGTGA CAACGACTTG GTTTTGGAAG TTGCCCAGCA CTTGGGTGAG 100 

AACACCGTCA GAGCCATTGC CATGGATGGT ACCGAGGGTT TGGTTAGAGG 150 

TGCGTCCGTC AAGGACACTG GCGCCCCTAT CTCGGTCCCC GTTGGCCGCG 200 

GGACTTTGGG TCGTATCATC AACGTCACCG GTGACCCCAT TGACGAGAGA 250 

GGTCCCATCG AGCTGACCCA GAGAAACCCT ATCCACGCCG ACCCCCCCTC 3 00 

GTTCGTTGAG CAGTCCACCA ACGCTGAGGT TTTGGAGACT GGTATCAAGG 350 

TTGTCGATTT GTTGGCTCCC TACGCCAGAG GTGGTAAGAT TGGTTTGTTC 400 

GGTGGTGCCG GTGTCGGTAA GACCGTCTTC ATTCAGGAGT TGATCAACAA 450 

CATCGCCAAG GCCCACGGTG GGTTCTCGGT CTTCACTGGT GTCGGTGAGA 500 

GAACTAGAGA GGGTAACGAC TTGTACCGTG AGATGAAGGA GACCGGTGTC 550 

ATCAACTTGG AGGGTGACTC CAAGGTGGCC TTGGTGTTCG GTCAGATGAA 600 

CGAGCCCCCT GGAGCCAGAG CCAGAGTCGC CTTGACCGGG TTGACCATTG 650 

CCGAATACTT TAGAGACGAG GAGGGTCAGG ATGTGTTGTT GTTCGTCGAC 700 

AACATCTTCA GATTCACCCA AGCTGGTTCG GAGGTGTCGG CCTTGTTGGG 750 

TCGTATTCCC TCTGCCGTCG GTTACCAGCC CAC CTTGGCA ACTGATATGG 800 

GATTGTTGCA GGAGCGTATC ACCACGACCA AGAAGGGTTC CGTCACCTCA 850 

GTGCAGGCCG TCTACGTCCC CGCTGATGAC TTGACTGACC CTGCTCCCGC 900 

CACCACCTTT GCCCACTTGG ACGCCACCAC CGTGTTGTCC AGAGGTATCT 950 

CTGAGTTGGG TATCTACCCC GCCGTCGACC CCTTGGACTC CAAATCGAGA 1000 

TTGTTGGACG CTGCCGTGGT CGGTCAGGAG CACTACGATG TTGCCTCGAA 1050 

CGTCCAGCAG ACCTTGCAGG CCTACAAGTC CTTGCAGGAT ATCATTGCCA 1100 

TTTTGGGTAT GGATGAGTTG TCCGAGGCTG ACAAGTTGAC CGTTGAGAGA 1150 

GCCAGAAAGA TCCAGAGATT CTTGTCGCAG CCCTTCGCTG TTGCCGAGGT 1200 

TTTCACTGGT ATCAAGGGTA GATTGGTCAG ATTGGAGGAC ACCGTCAGAT 1250 

CTTTCAAGGA GGTTTTGGAG GGTAAGTACG ACCACTTGCC CGAGA 1295 



2) INFORMATION FOR SEQ ID NO: 479 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 534 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coccidioides immitis 

(B) ' STRAIN: Silveira 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 479 



PCT/CA00/01150 



TTCAGGAATT GATTGTAAGT TCTGTTATCA ACTAAAGCCG ACAGCGGTTG 50 

CTGATATGCT CTAGAACAAC ATTGCTAAGG CTCACGGTGG TTACTCCGTG 100 

5 TTCACTGGTG TCGGTGAGCG TACCCGTGAG GGTAACGATT TGTACCATGA 150 

AATGCAAGAG ACCCGTGTCA TTCAACTCGA CGGAGAGTCC AAGGTCGCTC 200 

TTGTCTTCGG TCAAATGAAC GAGCCCCCTG GTGCCCGTGC CCGTGTTGCC 250 

CTTACCGGTT TGACCATTGC TGAATACTTC CGTGACGAGG AAGGCCAAGA 300 

CGGTAGGCTT CATGCTTCTA TCGCTAGGGG CGTGTGATAC AGGAGGCTAA 350 

10 TCGCTTTTCT AGTGCTTCTC TTTATTGACA ACATTTTCCG TTTCACTCAA 400 

GCTGGTTCTG AAGTGTCTGC CTTGCTCGGT CGTATTCCTT CCGCTGTCGG 450 

TTACCAACCT ACTCTCGCCG TCGACATGGG TGTTATGCAG GAACGTATCA 500 

CCACCACCAC CAAGGGATCC ATTACTTCAG TGCA 534 



15 



30 



45 



55 



2) INFORMATION FOR SEQ ID NO: 480 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 494 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus albidus 

(B) STRAIN: ATCC 66030 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 480 



CGTCTTGATT CAAGAATTGA TCAACAACAT CGCCAAGGCC CACGGTGGTT 50 

ACTCGGTCTT CACCGGTGTC GGTGAGCGAA CTCGAGAGGG TAACGATCTG 100 

35 TACCACGAAG TGAGTTGCGC CGTCCGAGTG TTTCCCGGGG AATCGCAAGA 150 

CTGATGTTGT CCCTTCTTCT CAGATGAGGG AAACCGGTGT CATCAACCTC 2 00 

GAGGGTGACT CCAAGGTCGC CTTGGTCTTC GGTCAGATGA ACGAGCCCCC 250 

TGGAGCCCGA GCCCGAGTCG CCTTGACCGG TTTGACCATT GCCGAATACT 3 00 

TCCGAGACGA GGAGGGTCAG GATGTCTTGT TGTTCATTGA CAACATTTTC 3 50 

40 CGATTCACCC AAGCCGGTTC CGAAGTGTCC GCCTTGTTGG GTCGTATCCC 400 

CTCCGCCGTC GGTTACCAGC CCACTCTGTC CACCGACATG GGTACCATGC 450 

AGGAGCGAAT TACCACCACC AAGAAGGGTT CCATCACTTC CGTC 4 94 



2) INFORMATION FOR SEQ ID NO: 481 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 415 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusari urn oxysporum 

(B) STRAIN: WSA-212 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 481 
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TTCAGGAGCT TATCAACAAC 
TTCACTGGTG TCGGTGAGCG 
AATGCAGGAG ACTTCCGTCA 
TGGTCTTCGG TCAGATGAAC 
CTTACCGGTC TTACTGTAGC 
CGTGCTGCTC TTCATTGACA 
AGGTGTCTGC CCTTCTCGGT 
ACCCTCGCCG TCGACATGGG 
CAAGGGTTCC ATTAC 



ATCGCCAAGG 
AACTCGTGAG 
TTCAGCTTGA 
GAGCCCCCTG 
TGAATACTTC 
ACATTTTCCG 
CGTATCCCCT 
TGGTATGCAA 



CCCACGGTGG TTACTCCGTC 
GGTAACGATC TGTACCACGA 
TGGCGAGTCC AAGGTCGCCC 
GAGCTCGTGC CCGTGTCGCC 
AGAGATGAGG AGGGTCAGGA 
ATTCACTCAG GCCGGTTCCG 
CTGCCGTCGG TTACCAGCCC 
GAGCGTATTA CCACCACCAC 



100 
150 
200 
250 
300 
350 
400 
415 



2) INFORMATION FOR SEQ ID NO: 482 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1281 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Geotrichum spp. 

(B) STRAIN: Lev -4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 482 

AGGACAACCT CCCCGCTATT CTTAACGCTC TTGAGCTTAA GAGAGATAAC 50 

GGTGAGAAGC TCGTTCTCGA GGTTGCCCAG CATCTGGGTG AGAACACTGT 100 

CCGTACTATT GCTATGGACG GTACTGAGGG TCTCGTCCGT GGTCAGCCCG 150 

TTGTTGACAC CGGTGCCCCC ATTAC CATTC CCGTTGGTCG TGGTACTCTT 200 

GGTAGAATTA TCAACGTCAT TGGTGAGCCC ATCGATGAGC GTGGACCCAT 250 

TGAGGCTACC AAGTACCTCC CCATCCACAC CGAGCCCCCC ACCTTCGCTG 300 

AGCAGTCTAC CTCCGCTGAG GTTCTTGAGA CTGGTATCAA GGTTGTCGAT 350 

CTCCTTGCCC CCTACGCCCG TGGTGGTAAG ATTGGTCTCT TCGGTGGTGC 400 

CGGTGTCGGT AAGACCGTTT TCATTCAGGA GCTGATTAAC AACATTGCCA 450 

AGGCCCATGG TGGTTTCTCC GTTTTCACCG GTGTCGGTGA GAGAACCCGT 500 

GAGGGTAACG ATTTGTACCG TGAGATGAAG GAGACCGGTG TCATCAACCT 550 

CGAGGGTGAG TCTAAGGTCG CTCTCGTTTT CGGTCAGATG AACGAGCCCC 600 

CTGGAGCCCG TGCCCGTGTT GCTCTTACTG GTCTTACCAT TGCTGAGTAC 650 

TTCCGTGATG AGGAGGGTCA GGATGTGTTG CTCTTCGTTG ACAACATTTT 700 

CCGTTTCACT CAGGCCGGTT CCGAGGTGTC TGCCCTTTTG GGTCGTATTC 750 

CCTCCGCTGT CGGTTACCAG CCCACCCTTG CCACTGATAT GGGTGCCCTG 800 

CAAGAGCGTA TTACCACCAC CCAGAAGGGT TCCGTCACTT CCGTCCAGGC 850 

CGTCTACGTC CCTGCCGATG ATTTGACCGA TCCTGCCCCT GCTACCACCT 900 

TCGCCCATTT GGATGCCACC ACCGTCTTGT CGCGTTCCAT TTCTGAGTTG 950 

GGTATCTACC CCGCTGTCGA TCCCCTTGAT TCCAAGTCTC GTCTTTTGGA 1000 

TATCACCGTC GTTGGCCAGG AGCACTACGA TGTTGCTACC CAGGTCCAGC 1050 

AGACCCTCCA GTCCTACAAG TCTCTTCAGG ATATCATTGC CATTTTGGGT 1100 

ATGGATGAGT TGTCTGAGGC TGATAAGCTT ACTGTCGAGC GTGCCCGTAA 1150 

GATCCAGAGA TTCCTTTCCC AGCCCTTCAC TGTCGCTGAG GTTTTCACTG 1200 

GTATCGAGGG CCGTCTCGTT CCTTTGAAGG ACACTGTTCG CTCTTTCAAG 1250 

GAGATCCTTG AGGGCAAGTA CGACCACCTC C 1281 



2) INFORMATION FOR SEQ ID NO: 483 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 586 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Histoplasma capsulatum 

(B) STRAIN: G185A5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 483 

15 ATTCAAGAAT TGAT CGTACG TTCCTCCGCC CCACACACGA TCAATGGAGA 50 

AAGAAACAAA TTTTTTGGGA GTGGTCATTT TTTCTAATAA TTCGAATAGA 100 

ACAACATTGC CAAAGCCCAC GGTGGTTACT CCGTGTTCAC TGGTGTCGGC 150 

GAGCGGACCC GTGAAGGAAA TGACTTGTAC CACGAAATGC AGGAAACCCG 200 

TGTTATCCAG CTCGATGGAG AGTCCAAGGT CGCCCTCGTT TTCGGTCAGA 250 

20 TGAACGAGCC TCCCGGAGCC CGTGCCCGTG TTGCCCTCAC TGGTCTGACC 300 

GTTGCCGAGT ACTTCCGTGA CGAGGAAGGC CAAGACGGTA TGTATAAGTA 350 

TACACCGTAG CAAATCAACA CAGAGCTTCA CTCACGCTCG GATTTAGTGC 400 

TTCTCTTCAT CGACAACATT TTCCGCTTCA CTCAGGCCGG TTCCGAAGTG 450 

TCTGCCCTGC TCGGCCGTAT TCCCTCCGCC GTCGGTTACC AACCCACCCT 500 

25 CGCCGTGGAC ATGGGTGGTA TGCAGGAACG TATCACCACT ACCACCAAGG 550 

GCTCCATCAC CTCTGTGCAR GCCGTCTACG TCCCCG 586 



30 2) INFORMATION FOR SEQ ID NO: '484 

(i) SEQUENCE CHARACTERISTICS : 

(A) : ; LENGTH : 1145 bases ' 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia furfur 

(B) STRAIN: ATCC 42132 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 484 



45 



GCGTGGCCAG AAGGTCATTG ACACTGGTGC TCCCATCACC ATCCCCGTCG 50 

GTGGTGCCAC ACTGGGTCGT ATTCTGAACG TCACGGGTGA CCCTATTGAC 100 

GAGCGTGGCC CCGTTAAGAC TGACGTTTTC CGCCCCATTC ACCGTGACCC 150 

CCCTGCCTTT GTCGAGCAGT CGACTGATGC CGAGATTCTC GAGACTGGTA 200 

50 TCAAAGTCGT TGACCTGATT GCCCCTTACG CCCGTGGTGG TAAGATTGGT 250 

CTGTTCGGTG GTGCCGGTGT CGGTAAGACC GTGCTTATCC AGGAGCTCAT 300 

CAACAACATC GCCAAGGCCC ACGGTGGTTT CTCCGTGTTC ACTGGTGTCG 350 

GTGAGCGTAC TCGTGAGGGT AACGATTTGT ACCACGAGAT GATTGAAACC 400 

GGTGTCATTA ACCTCGAGGG TGACTCGAAG GTGGCTCTGG TGTTCGGTCA 450 

55 GATGAACGAG CCCCCGGGTG CCCGTGCGCG TGTCGCTCTT ACTGGTCTGA 500 

CTGTGGCCGA GTACTTCCGT GACGACGAGG GCCAGGATGT GCTGCTGTTC 550 

ATTGACAACA TTTTCCGTTT CACCCAGGCC GGTTCGGAGA CTTCGGCTCT 600 

GCTGGGTCGT ATCCCTTCGT CGGTCGGTTA CCAGCCCACT TTGTCGACCG 650 

ATATGGGTGC CATGCAGGAG CGTATCACCA CCACCAAGAA GGGTTCGATT 700 

60 ACGTCGGTGC AGGCCGTCTA CGTGCCCGCC GATGATGTCA CTGACCCTGC 750 
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CCCTGCCACT ACCTTCGCCC ACCTTGACGC 
CGATCGCTGA GCTGGGTATC TACCCCGCTG 
TCGCGTATGC TTGACCCCGC TATTGTGGGT 
CTCTGGCGTG CAGAAGCTGC TCCAGGACTA 
5 TTGCCATTCT GGGTATGGAT GAGCTTTCTG 
GAGCGTGCCC GTAAGATGCA GCGTTTCATG 
CCAGGTCTTT ACTGGTATCG AAGGTCGTCT 
TCAAGGCCTG CAAGGAGATC CTGTCGGGCA 



PCT/CA00/01150 

TACCACTGTG CTTGACCGTT 800 

TTGACCCGTT GAACTCGAAC 850 

CAGGAGCACT ACGACGTGGC 900 

CAAGTCGCTT CAAGATATCA 950 

AGGAGGACAA GCTCACTGTC 1000 

TCGCAGCCTT TCGCTGTCGC 1050 

TGTTGCCCTG AAGGACACGA 1100 

AGCACGACAA CCTCC 1145 



10 



2) INFORMATION FOR SEQ ID NO: 485 

(i) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: 1261 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia pachydermatis 

(B) STRAIN: ATCC 42756 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 485 

TCCCGCCATC TTCAACGCCC TGGAGGTCCA GGACATGAAG AACGGTGGCC 50 

GCCTTGTTCT GGAGGTTGCC CAGCACCTTG GTGAGAACAC TGTTCGTTGC 100 

30 ATTGCTATGG ACGGTACCGA GGGTCTTGTC CGTGGTCAGA AGGTCCTTGA 150 

CACTGGTGCC CCGATCACTA TCCCTGTCGG TAACGGTACC TTGGGCCGYA 200 

TCCTGAACGT CACTGGTGAG CCTGTGGATG AGCGTGGTCC GGTTAAGACT 250 

GACGTCTACC GTCCAATCCA CCGTGAGCCC CCGGCGTTCG TTGAGCAGTC 300 

GACTGATGCT GAGATTCTTG AGACTGGTAT CAAGGTGGTC GACCTGCTCG 350 

35 CCCCATACGC TCGTGGTGGT AAGATTGGTC TWTTCGGTGG TGCYGGTGTY 400 

GGTAAGACCG TGCTGATTCA GGAGCTTATC AACAACATTG CCAAGGCCCA 450 

CGGTGGTTTC TCGGTGTTCA CTGGTGTCGG TGAGCGTACT CGTGAGGGTA 500 

ACGATCTGTA CCATGAAATG ATTGAGACTG GTGTCATCAA CGTTGACGGT 550 

GACTCGAAGG TCGCTCTCGT GTTCGGTCAG ATGAACGAGC CCCCGGGTGC 600 

40 CCGTGCCCGT GTCGCCCTGA CYGGTCTGAC CATCGCCGAG TACTTCCGTG 650 

ACGACGAGGG TCAGGATGTG CTGCTCTTCA TTGACAACAT TTTCCGTTTC 700 

ACTCAGGCTG GTTCGGAGAC TTCGGCTCTG CTGGGTCGTA TCCCGTCGGC 750 

TGTCGGTTAC CAGCCKACCC TTGCCACGGA TATGGGTGCC ATGCAGGARC 800 

GTATYACCAC CACCAAGAAG GGTTCGATTA CCTCGGTGCA GGCYGTTTAC 850 

45 GTTCCGGCCG ACGATGTGAC TGACCCTGCC CCGGCCACGA CCTTCGCCCA 900 

CTTGGACGCC ACCACGGTGT TGGACCGTTC GATTGCYGAG CTGGGTATCT 950 

ACCCGGCCGT CGACCCGCTG AACTCGAAGT CGCGTATGCT TGACCCGTCA 1000 

ATTGTCGGTG TGGAGCACTA CAACGTTGCT TCGGGTGTCC AGAAGCTTCT 1050 

YCAGGACTAC AAGTCGCTCC AAGATATCAT TGCCATTCTG GGTATGGATG 1100 

50 AGTTGTCGGA GGAGGACAAG CTCACTGTCG AGCGTGCCCG TAAGATGCAG 1150 

CGTTTCCTGT CGCAGCCTTT CGCTGTGGCC CAGGTCTTCA CTGGTATCGA 1200 

GGGTCGTCTT GTGTCGCTCA AGGACACCAT CAACGCCTGC AAGGAGATTC 1250 

TGTCCGGTAA G 1261 



55 



2) INFORMATION FOR SEQ ID NO: 486 

(i) SEQUENCE CHARACTERISTICS : 
60 (A) LENGTH: 12 82 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Metschnikowia pulcherrijna 

(B) STRAIN: DSM 70336 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 486 



AGGAGGGCAA CTTGCCAGCT ATCTTCAACG 
GACCAGAAGT TGGTCTTGGA GGTGGCCCAG 
CAGAACCATT GCCATGGACG GTACCGAGGG 
TCACCGACAC YGGTGCCCCT ATCTCCGTGC 
GGTCGTATTA TCAATGTTGT TGGTGAGCCA 
CAACACCAAG AAGAGAAACC CTATTCACAC 
AGCAATCCAC TTCCGCCGAG GTCTTGGAGA 
TTGTTGGCCC CTTACGCCAG AGGTGGTAAG 
CGGTGTCGGT AAGACCGTGT TCATCCAGGA 
AGGCCCACGG TGGTTTCTCC GTTTTCACCG 
GAGGGTAACG ATTTGTACCG TGAAATGCAG 
CGAGGGTGAC TCCAAGGTCG CCTTGGTTTT 
CAGGAGCTAG AGCYAGAGTT GCCTTGACCG 
TTCAGAGACG AGGAGGGTCA GGATGTGTTG 
CAGATTCACC CAAGCCGGTT CTGAGGTGTC 
CATCCGCTGT CGGATACCAG CCAACCTTGG 
CAGGAGAGAA TTACCACCAC CAAGAAGGGT 
CGTCTACGTG CCAGCCGATG ACTTGACTGA 
TCGCCCACTT GGACGCCACC ACTGTGTTGT 
GGTATCTACC CCGCTGTCGA CCCCTTGGAC 
CGCCACCGTT GTTGGCCAGG AGCACTACGA 
AAACTTTACA AGCTTACAAG TCCTTGCAGG 
ATGGATGAGT TGTCCGAGAC CCGACAAGTT 
AAGATCCAGA AGTTCTTGTC CCAGCCATTT 
CGGTATTGAG GGTAGATTGG TTAGATTGGA 
AGGAGGTTTT GGAAGGTAAG TACGACCACT 



CTTTGACGTT GAAGAACGGC 50 

CACTTGGGTG AGAACACCGT 100 

TTTGGTCAGA GGCGCCTCTG 150 

CTGTCGGCCG TGAGACCTTG 200 

ATCGACGAGA GAGGCCCAAT 250 

CGACCCACCT TCGTTTGTCC 300 

CTGGTATCAA GGTTGTCGAC 350 

ATTGGTTTGT TCGGTGGTGC 400 

GTTGATTAAC AACATTGCCA 450 

GTGTCGGTGA GAGAACCAGA 500 

GAGACTGGTG TCATCAACTT 550 

CGGTCAGATG AACGAGCCTC 60 0 

GTTTGACCAT CGCCGAGTAC 650 

TTGTTCGTCG ACAACATTTT 700 

TGCCTTGTTG GGTCGTATTC 750 

CCACCGATAT GGGTGCCTTG 800 

TCCGTCACCT CCGTCCAGGC 850 

CCCTGCGCCA GCCACCACTT 900 

CCAGAGGTAT CTCTGAGTTG 950 

TCCAACTCCA GATTGTTGGA 1000 

CGTCGCCACC AACGTCCAGC *-. ' 1050 

ATATCATTGC CATTTTGGGT 1100 

GACCCGTCGA GAGAGCCAGA 1150 

GCCGTCGCCG AGGTTTTCAC 1200 

GGACACCGTT AGATCCTTTA 1250 

TG 1282 



2) INFORMATION FOR SEQ ID NO : 4 87 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 482 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffei 

(B) STRAIN: WSA-214 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 487 

TGTCTTTATC CAGGAGTTGA TTGTACGTCT TTACCTTTCT GCCTGACTGT 50 

TTACGACAAC TAACGAAAGC GTAGAACAAC ATTGCCAAGG CTCACGGTGG 100 

TTACTCTGTC TTCACTGGTG TCGGTGAACG TACTCGTGAG GGTAACGATT 150 
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TGTACCACGA AATGCAGGAA 
AAGGTCGCCC TCGTGTTCGG 
CCGTGTCGCT CTTACTGGTT 
AAGGTCAGGA CGTGCTTCTC 
GCCGGTTCTG AGGTGTCTGC 
TTACCAGCCC ACCCTTGCCG 
CCACCACCAC CAAGGGTTCC 



ACTGGTGTCA 
TCAGATGAAC 
TGACCATTGC 
TTCATTGACA 
CCTTCTGGGT 
TCGACATGGG 
ATCACCTCCG 



TTCAGCTCGA GGGTGAATCC 
GAGCCCCCCG GTGCCCGTGC 
CGAGTACTTC CGTGACGAGG 
ACATTTTCCG TTTCACTCAG 
CGTATCCCCT CTGCCGTCGG 
TATCATGCAG GAGCGTATTA 
TC 



200 
250 
300 
350 
400 
450 
482 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 488 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1290 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) ' TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 18205 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 488 

TCGAACAAGG TAACTTACCA GCTATCTTGA ATGCTTTAGA AATCAAAACC 50 

CCAAGTGGTA CTCCATTAGT TTTAGAAGTT GCTCAACATT TAGGTGAAAA 100 

CACTGTCAGA ACTATTGCTA TGGATGGTAC TGAAGGTTTA GTTCGTGGTG 150 

AACAAGTTAC TGATACTGGT TCTCCAATCA CTGTCCCAGT TGGTCGTGAA 200 

ACTTTAGGTC GTATTATCAA CGTTGTTGGT GAACCAATTG ATGAACGTGG 250 

TCCAATTAAC ACCAAACAAA GAAACCCAAT TCACGCTGAA CCACCTTCAT 300 

TCAGTGAACA ATCAACTGCT GCTGAAGTTT TAGAAACTGG TATCAAAGTT 350 

GTYGATTTAT TAGCTCCATA CGCTAGAGGT GGTAAAATTG GTTTATTCGG 400 

TGGTGCCGGT GTCGGTAAAA CTGTCTTTAT CCAAGAATTG ATTAACAACA 450 

TTGCTAAAGC TCATGGTGGT TTCTCAGTTT TCACCGGTGT TGGTGAAAGA 500 

AC C AG AGAAG GTAACGATTT ATACCGTGAA ATGAAAGAAA CTGGTGTTAT 550 

TAACTTGGAA GGTGATTCTA AGGTCGCTTT AGTTTTCGGT CAAATGAATG 600 

AACCACCAGG TGCTAGAGCT CGTGTTGCTT TAACTGGTTT GACCATTGCT 650 

GAATACTTCA GAGATGAAGA AGGTCAAGAT GTCTTGTTAT TCGTTGATAA 700 

CATTTTCAGA TTCACCCAAG CCGGTTCAGA AGTTTCTGCC TTATTAGGTC 750 

GTATTCCATC TGCTGTCGGT TATCAACCAA CTTTAGCAAC TGATATGGGT 800 

TTGTTACAAG AACGTATTAC CACCACACAA AAAGGTTCAG TTACTTCTGT 850 

CCAAGCTGTT TATGTCCCAG CTGATGATTT AACAGATCCT GCTCCAGCTA 900 

CCACTTTCGC CCATTTGGAT GCTACTACTG TCTTGTCTCG TGGTATTTCA 950 

GAATTAGGTA TTTACCCAGC TGTCGATCCA TTAGATTCTA AATCAAGATT 1000 

ATTAGATGCT TCAGTTGTTG GTCAAGAACA TTATGATGTT GCTACCAACG 1050 

TTCAACAAAC TTTACAAGCT TACAAATCTT TACAAGATAT TATTGCTATT 1100 

TTAGGTATGG ATGAATTGTC TGAACAAGAT AAATTGACTG TCGAAAGAGC 1150 

AAGAAAAATC CAAAGATTCT TATCTCAACC ATTTGCTGTT GCCGAAGTTT 1200 

TCACTGGTAT YCCAGGTAGA TTGGTTAGAT TAAAAGACAC TATCAAATCA 1250 

TTCAAAGATG TTTTGGAAGG TAAATATGAT CACTTACCAG 1290 



2} INFORMATION FOR SEQ ID NO: 489 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 91 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 2149 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 489 

CCAATTCGAA CAAGGTAACT TACCAGCTAT CTTGAATGCT TTAGAAATCA 50 

AAACCCCAAG TGGTACTCCA TTAGTTTTAG AAGTTGCTCA ACATTTAGGT 100 

GAAAACACTG TCAGAACTAT TGCTATGGAT GGTACTGAAG GTTTAGTCCG 150 

15 TGGTGAACAA GTTACTGATA CTGGTTCTCC AATCACTGTC CCAGTTGGTC 200 

GTGAAACTTT AGGTCGTATT ATCAACGTTG TTGGTGAACC AATTGATGAA 250 

CGTGGTCCAA TTAACACCAA ACAAAGAAAC CCAATTCACG CTGAACCACC 300 

TTCATTCAGT GAACAATCAA CTGCTGCTGA AGTTTTAGAA ACTGGAATCA 350 

AAGTTGTTGA TTTATTAGCT CCATACGCTA GAGGTGGTAA AATTGGTTTA 400 

20 TTCGGTGGTG CCGGTGTCGG TAAAACTGTC TTTATCCAAG AATTGATTAA 450 

CAACATTGCT AAAGCTCATG GTGGTTTCTC AGTTTTCACC GGTGTTGGTG 500 

AAAGAACCAG AGAAGGTAAC GATTTATACC GTGAAATGAA AGAAACTGGT 550 

GTTATTAACT TGGAAGGTGA TTCTAAGGTC GCTTTAGTTT TCGGTCAAAT 600 

GAATGAACCA CCAGGTGCTA GAGCTCGTGT TGCTTTAACT GGTTTGACCA 650 

25 TTGCTGAATA CTTCAGAGAT GAAGAAGGTC AAGATGTCTT GTTATTCGTT 700 

GATAACATTT TCAGATTCAC CCAAGCCGGT TCAGAAGTTT CTGCCTTATT 750 

AGGTCGTATT CCATCTGCTG TCGGTTATCA ACCAACTTTA GCAACTGATA 800 

TGGGTTTGTT ACAAGAACGT ATTACCACCA CACAAAAAGG TTCAGTTACT 850 

TCTGTCCAAG CTGTTTATGT CCCAGCTGAT GATTTAACAG ATCCTGCTCC 900 

30 AGCTACCACT TTCGCCCATT TGGATGCTAC TACTGTCTTG TCTCGTGGTA 950 

TTTCAGAATT AGGTATTTAC CCAGCTGTCG ATCCATTAGA TTCTAAATCA 1000 

AGATTATTAG ATGCTTCAGT TGTTGGTCAA GAACATTATG ATGTTGCTAC 1050 

CAACGTTCAA CAAACTTTAC AAGCTTACAA ATCTTTACAA GATATTATTG 1100 

CTATTTTAGG TATGGATGAA TTGTCTGAAC AAGATAAATT GACTGTCGAA 1150 

35 AGAGCAAGAA AAATCCAAAG ATTCTTATCT CAACCATTTG CTGTTGCCGA 12 00 

AGTTTTCACT GGTATCCCAG GTAGATTGGT TAGATTAAAA GACACTATCA 1250 

AATCATTCAA AGATGTTTTG GAAGGTAAAT ATGATCACTT A 12 91 



40 

2) INFORMATION FOR SEQ ID NO: 490 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 508 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhodotorula minuta 

(B) STRAIN: ATCC 10658 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 490 

CGTATTGATT CGTGAGTGGC CGTTCCCTTA CAGCAAGCTT ATAAAGGAGC 50 

GAAAAAGATC TGACATTCGG CTTATGTGCT ATACAGAGGA ACTCATCAAC 100 

AACGTCGCCA AGGCTCATGG TGGTTACTCT GTCTTCACCG GTGTCGGAGA 150 

60 GCGAACACGT GAAGGTAACG ATCTCTACCA CGAAATGATT GAAACCGGTG 200 
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TCATTCAGCT CAAGAACGAC 
ATGAACGAGC CCCCCGGAGC 
CATCGCCGAG TACTTCCGTG 
TCGACAACAT TTTCCGATTC 
CTCGGACGTA TCCCATCTGC 
TATGGGTGGT ATGCAAGAGC 
CCTCCGTC 



AAGTCCAAGG CCGCTCTGGT CTTCGGACAG 
TCGTGCCCGT GTCGCTCTGA CCGGTCTCAC 
ACGTCGAAGG ACAGGATGTG CTACTCTTCA 
ACCCAGGCCG GTTCAGAGGT ATCTGCCCTG 
TGTCGGATAC CAGCCCACAC TCTCAACCGA 
GAATCACAAC CACCAAGAAG GGTTCGATTA 



25t) 
300 
350 
400 
450 
500 
508 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 491 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 686 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhodotorula mucilaginosa 

(B) STRAIN: ATCC 66034 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 491 

TGTCCTCATT CAGGAGCTCA TCAACAACAT CGCCAAGGCC CACGGTGGTT 50 

ACTCGGTCTT CACCGGTGTC GGCGAGCGTA CCCGTGAGGG TAACGGTGAG 100 

TCTCCCCCTT CAAACTTTTG GCCGGCTAGT TGGCGCAGCG CAAACTGACG 150 

CGCGCGCCCT GTCCAGACTT GTACCACGAG ATGATCGAGA CTGGTGTCAT 200 

CCAGCTCGAG AACGACAACT CGAAGTGCGC TCTCGTGTTC GGCCAGATGA 250 

ACGAGCCCCC TGGTGCCCGT GCCCGTGTCG CTCTCACTGG GTTCGTCCTT 300 

TCTCTCTCTC GAGCGTCCTG GCTTGATACG GAACGCTGAC ACGTCACGCA 350 

GTCTCACTAT TGCTGAGTAC TTCCGTGACG AGGAGGGCCA GGACGTGCTC 400 

CTCTTCATCG ACAACATCTT CCGTTTCACC CAGGGTGAGC CGCCTCCGCG 450 

GGCATTCTCC CGTTTCTTTC GCGCTGACGT CTGTCCCGTA TAGCCGGTTC 500 

GGAGGTGTCT GCCCTTCTCG GACGTATCCC GTCCGCTGTC GGATACCAGC 550 

CGACTCTCTC GACCGACATG GGTCAGATGC AGGAGCGTAT CACGTAAGTT 600 

TGGCCGCAGC TCCGTCCGCG GCGCCCTTTG TGTCTGACCG TGTTCCACCG 650 

CTCAGCACCA CCAAGAAGGG CTCGATCACC TGTGTC 686 



2) INFORMATION FOR SEQ ID NO: 492 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 625 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporobolomyces salmonicolor 

(B) STRAIN: ATCC 32311 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 492 
TGTCTTGATT CAGGAGCTCA TCAACAACGT CGCCAAGGCG CACGGTGGTT 50 
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ACTCCGTTTT 
CACACTTTCG 
ACCTCGAACA 
TCGACAACGA 
CCCCCTGGCG 
GTACTTCCGT 
TCTTCCGTTT 
TCGTGGTGAC 
CCCTTCTCGG 
ACCGACATGG 
TCTCTCGTTT 
CAAGAAGGGT 
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CACCGGTGTC 
CCGACCTGGC 
GATCTCTACC 
CAAGTCGAAG 
CCCGTGCCCG 
GACGACGAGG 
CACCCAGGGT 
TGACAACCTG 
TCGTATCCCG 
GTGGCATGCA 
CGCTCTGCAT 
TCGATCACCT 



GGTGAGCGTA 
CATTCCGGAA 
ACGAGATGAT 
ACTGCTCTCG 
TGTCGCTCTT 
GCCAGGACGT 
ACGTTCGATC 
TTGCGCGTGC 
TCCGCTGTCG 
GGAGCGTATC 
CGCTCACGCA 
GTGTC 



CCCGTGAGGG TAACGGTGAG 
CGTGCGGACT GACGAGGACC 
CGAGACCGGT GTCATTCAGC 
TCTTCGGCCA GATGAACGAG 
ACTGGTCTCA CCATCGCGGA 
GCTTCTCTTC ATCGACAACA 
CCGCCCGTCC AACACGAATG 
AGCCGGTTCG GAGGTGTCTG 
GATACCAGCC CACTCTCTCG 
ACGTACGCCC TCTTCTGCTT 
TGTTCGCCCG ACAGCACCAC 
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2) INFORMATION FOR SEQ ID NO: 493 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1211 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckii 

(B) STRAIN: WSA-148 

' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 493 

*> 

TCTCGTCCGT GGTGCTAAGG CCACTGACAC TGGTTCCCCC ATTACCATCC 50 

CCGTCGGCCC CGGTACCCTC GGTCGCATCA TGAACGTCAC CGGTGACCCG 100 

ATCGACGAGC GCGGTCCCAT CAAGACCGAC AAGTTCCGTC CCATCCACGC 150 

TGAGGCTCCC GAGTTCGTTG AC CAGTCGAC CACCGCTGAG GTTCTCGTGA 2 00 

CTGGTATCAA GGTCGTCGAT CTGCTTGCTC CCTACGCCCG TGGTGGTAAG 250 

ATTGGTCTGT TTGGCGGTGC CGGTGTTGGC AAGACCGTGT TCATCCAGGA 3 00 

GCTCATCAAC AACATCGCCA AGGCCCACGG TGGTTACTCC GTCTTCACCG 350 

GTGTCGGCGA GCGTACCCGT GAGGGTAACG ATCTGTACCA CGAAATGCAG 400 

GAGACCTCTG TCATTCAGCT TGACGGTGAC TCCAAGGTCG CCCTGGTGTT 450 

CGGTCAGATG AACGAGCCCC CTGGTGCTCG TGCCCGTGTC GCCTTGACCG 500 

GTTTGACTGT CGCTGAGTAC TTCCGTGACG AGGAGGGCCA GGATGGTATG 550 

TTTTGAATTA TTTCCTTGTC GTACAGTTCC AAATCGAAGA ATTACTAACT 600 

TGTCAGTGCT TCTCTTCATC GACAACATTT TCCGCTTCAC CCAGGCCGGT 650 

TCTGAGGTGT CTGCCCTTCT GGGTCGTATT CCCTCCGCTG TCGGTTACCA 700 

GCCCACGCTC GCCGTGGACA TGGGTCTGAT GCAGGAGCGT ATTACCACCA 750 

CCCGCAAGGG CTCAATTACC TCCGTCCAGG CCGTCTACGT GCCCGCTGAC 800 

GATCTGACGG ATCCCGCCCC CGCCACCACC TTCGCCCATC TGGACGCCAC 850 

CACTGTGCTG TCCCGAGGTA TCTCTGAGCT GGGTATCTAC CCCGCTGTCG 900 

ACCCCCTCGA CTCCAAGTCG CGTATGCTGG ACCCCCGTAT TGTCGGTGAC 950 

GACCACTACG AGACCGCCAC TCGCGTCCAG CAGATCCTCC AAGAGTACAA 1000 

GTCGCTGCAG GACATCATCG CCATTCTGGG TATGGACGAG CTGTCTGAGG 1050 

CCGACAAGCT TACAGTCGAG CGTGCTCGTA AGATCCAGCG TTTCCTGAGC 1100 

CAGCCGTTCA CGGTCGCGCA GGTCTTCACT GGTATCGAAG GCCAGCTGGT 1150 

CGATCTGAAG GACACTATCG CTTCGTTCAA GGCTATCCTG AGCGGTGAGG 1200 

GTGACAGCCT T 1211 
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2) INFORMATION FOR SEQ ID NO: 4 94 



PCT/CAOO/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1133 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Stephanoascus ciferrii 

(B) STRAIN: ATCC 52550 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 494 

TCTTGTTAGA GGCACCCCAG TCAAGGACAC TGGTGCTCCA ATTACCATTC 50 

CAGTTGGTAA CGGCACTTTG GGCCGTATCG TCAACGTTCT CGGTGAGCCA 100 

AT TGATGAGC GTGGACCAGT CAAGGCTGAC AAGTT CAGAC CTATTCACGC 150 

TGAGCCACCA ACCTTCGCTG ACCAGTCCAC CTCTGCCGAG GTTCTTGAGA 20 0 

CCGGTATTAA GGTTGTCGAC TTGCTTGCCC CTTATGCCAG AGGTGGTAAG 250 

ATTGGTCTTT TCGGTGGTGC CGGTGTCGGT AAGACTGTGT TCATTCAGGA 300 

GCTTATTAAC AACATTGCTA AGGCCCACGG TGGTTACTCT GTCTTCACTG 350 

GTGTCGGTGA GCGAACTCGT GAAGGTAACG ATTTGTACCA CGAAATGATG 4 00 

GAGACCGGTG TCATCAACCT TGAGGGTGAC TCCAAGGTGT CTCTTGTGTT 450 

CGGTCAGATG AACGAGCCTC CAGGAGCCCG TGCCCGTGTT GCCTTGACCG 500 

GTTTGACCAT TGCCGAGTAC TTCAGAGATG AGGAGGGCCA GGATGTCTTG 550 

TTGTTCATTG ACAACATTTT CCGATTCACC CAGGCCGGTT CTGAGGTCTC 600 

TGCCTTGTTG GGTCGTATCC CATCTGCCGT CGGTTACCAA CCAACCTTGG 650 

CTACTGATAT GGGTGGTCTT CAAGAACGTA TTACCACCAC TCAAAAGGGT 700 

TCCGTCACCT CTGTCCAGGC TGTCTACGTC CCAGCTGACG ATTTGACTGA 750 

TCCTGCCCCA GCTACCACCT TCGCCCATTT GGACGCCACC ACCGAATTGT 800 

CCCGATCTAT -CTCTGAGTTG GGTATCTACC CAGCTGTCGA CCCTCTTGGT 850 

TCCAAGTCCC GTCTTTTGGA TGCCTCCGTC GTCGGCCAAG AGCACTACGA 900 

CGTTGCCGCC AACGTCCAAC AGACCTTGCA GGCCTACAAG TCTCTCCAGG 950 

ATATCATTGC CATTTTGGGT ATGGACGAAT TGTCTGAGGC TGATAAGCTC 1000 

ACTGTCGAGC GTGCTCGTAA GATGCAGAGA TTCCTTTCTC AGCCATTCAC 1050 

CGTCGCTGAG GTCTTCACTG GTCTCGAGGG TAGACTCGTT TCTTTGAAGG 1100 

ACACCATCCG ATCCTTCAAG GAGATCCTTG ACG 1133 



2) INFORMATION FOR SEQ ID NO: 495 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 608 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: WSA-225 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 495 

GGAGTTGATT GTAAGTCATT TGAAACCCAG CCCCAAGAAA CAGAAGCTAG 50 
GCGAAAATTG GACAATTGAG CAATTTAGCC ATTGGAGAAA AGAAATTTCG 100 
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AGTATTAATT GTTTTTATAG AACAACATTG 
TCTGTCTTCA CTGGTGTCGG AGAGCGTACC 
CCATGAAATG CAGGAGACCC GTGTCATTCA 
TCGCCCTGGT CTTCGGCCAG ATGAACGAGC 
5 GTTGCTCTTA CTGGTTTGAC CATTGCTGAG 
TCAAGACGGT GAGTTTCTTA TGGATAAAAA 
TTTTTTTCAA GAAATTCATG TTCTAACAAA 
TCATCGACAA CATTTTCCGT TTCACTCAGG 
CTGCTTGGTC GTATCCCATC TGCCGTCGGT 
10 CGACATGGGT GGTATGCAGG AACGTATTAC 
TTACCTCC 



PCT/CA00/01150 

CCAAGGCTCA CGGTGGTTHC 1*59 

CGTGAAGGAA ACGATCTCTA 200 

GCTTGATGGC GAGTCCAAGG 250 

CCCCAGGTGC CCGTGCCCGT 300 

TACTTCCGTG ATGAGGAAGG 350 

AAAATTTTTT TTTTTTTTTT 400 

GTGTATTCTA GTGCTTCTCT 450 

CTGGTTCCGA AGTGTCTGCC 500 

TACCAACCCA CTCTTGCCGT 550 

CACCACCAAG AAGGGATCCA 600 

608 



15 2) INFORMATION FOR SEQ ID NO: 4 96 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 794 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Vfangiella dermatitidis 

(B) STRAIN: WSA-229 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 496 



GTTTATTCAA GAACTCATTG TGGGTGGCAT TCTCATAATG TTTCGGCCAC 50 

AATTACTGAT TGAAAATAGA ACAACATTGC AAAGGCTCAT GGTGGTTACT 100 

CCGTGTTCAC TGGTGTCGGC GAGCGAACTC GTGAGGGTAA CGACTTGTAC 150 

• CACGAAATGC AGGAGACCTC TGTCATTCAG CTCGATGGCG AGTCCAAGGT 200 

35 CGCGCTGGTG TTTGGTCAAA TGAACGAACC TCCTGGTGCT CGTGCTCGTG 250 

TTGCTCTGAC TGGGTAAGTT GTTCCTTCGC TTCTTGCGCG TATCCACATC 300 

CCCATCTTGA GAATACGTCT GCCACCATGT CATGTGATGT TGGGCTGGTT 350 

CTGGTTTTTG GGAGGCCCTC AAGTTCAATT TTTGGATGAC AGCACCAGCT 400 

TTACAAGATT ATGCTAACTT AATGGAGTCT TACGGTGGCT GAGTTCTTCA 450 

40 GGGATGAGGA GGGACAGGAT GGTAAGTTTG ATAACAATCT CGTCGGTGTC 500 

AATATCGACG GCGTACTCTT CGCATCAAAA AACCAAAGAG GTGGTTTGGT 550 

GTGAGAAGTG CGCCGGAAAT AATGGCAACC ACGTGACAAT GACCACGTGT 600 

GGGGCTCCCG TGCTAACACG TGACAGTCTT GCTCTTCATC GACAACATTT 650 

TCCGATTCAC TCAASCCGGT TCTGARGTGT CTGCCTTGCT TGGTCGTATT 700 

45 CCATCTGCCG TCGGTTACCA ACCCACACTC GCCGTCGACA TGGGTCTCAT 750 

GCAGGAACGT ATCACCACCA CCCGGAAGGG ATCCATCACA TCTG 794 



50 2) INFORMATION FOR SEQ ID NO: 497 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1148 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Yarrowia lipolytica 

(B) STRAIN: ATCC 38295 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 497 

5 

TCTTGTCCGA GGCACCGCCG TCGCTGACAC CGGTGCTCCC ATCACTATCC 50 

CCGTCGGCCG AGGTACCCTT GGTCGAATCA TCAACGTCTG TGGTGAGCCC 100 

ATTGACGAGC GAGGACCCAT CGAGGCTTCC AAGTACCTCC CCATCCACGC 150 

TGACCCCCCT ACCTTCGCTG AGCAGTCTAC CTCCGCTGAG GTTCTCGAGA 2 00 

10 CCGGTATTAA GGTCGTCGAC CTCCTCGCCC CTTACGCCCG AGGTGGTAAG 250 

ATTGGTCTCT TCGGTGGTGC CGGTGTCGGT AAGACTGTCT TCATCCAGGA 300 

GCTGATTAAC AACATTGCCA AGGCCCATGG TGGTTTCTCC GTTTTCTGCG 350 

GTGTCGGTGA GCGAACCCGA GAGGGTAACG ATCTTTACCG AGAGATGAAG 4 00 

GAGACTGGTG TCATCAACCT CGAGGGTGAG TCTAAGGTCA CCCTCGTCTT 450 

15 CGGTCAGATG AACGAGCCTC CCGGAGCCCG TGCCCGAGTC GCCCTTACTG 500 

GTCTGACCAT TGCCGAGTAC TTCCGAGACG AGGAGGGTCA GGATGTGTTG 550 

CTCTTCGTTG ACAACATTTT CCGATTCACC CAGGCCGGTT CCGAGGTGTC 600 

CGCTCTGCTT GGTCGAATTC CCTCCGCTGT CGGTTAC CAG CCCACTCTGG 650 

CCACCGATAT GGGTGCCCTC CAGGAGCGAA TTACCACCAC CCAGAAGGGT 700 

20 TCCGTCACTT CCGTCCAGGC CGTCTACGTG CCTGCCGATG ATTTGACCGA 750 

TCCTGCTCCC GCCACCACCT TCGCCCATCT TGACGCCACC ACCGTCCTGT 800 

CCCGAGGTAT TTCCGAGCTG GGTATCTACC CCGCTGTCGA TCCCCTTGAT 850 

TCCAAGTCTC GACTTCTGGA TATCGATGTT GTCGGAAAGG AGCACTACGA 900 

TGTTGCTTCC AACGTCCAGC AGACCCTCCA GGCTTACAAG TCTCTCCAGG 950 

25 ATATCATTGC CATTCTTGGT ATGGATGAGC TGTCCGAGCA GGACAAGCTG 1000 

ACCGTCGAGC GAGCTCGAAA GATCCAGCGA TTCCTGTCTC AGCCCTTCAC 1050 

CGTCGCCGAG GTTTTCACCG GTATTGAGGG ACGACTTGTC TCTCTCAAGG 1100 

ACACTGTCCG ATCCTTCAAG GAGATCCTTG ACGGTAAGCA CGATGCTC 1148 

30 

2): INFORMATION FOR SEQ. ID NO*: 498 

-\ • - 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 966 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

* 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: VISA- 172 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 498 



GCGCTATTGT CGTTGTTGCT GCCTCCGACG GTCAGATGTA GGTGGAACAT 50 

CTTGGGAAAT ACGTCGTAAA ACACGTCGCT TACGTTTTCG CGAATAGGCC 100 

50 CCAGACTCGT GAGCATTTGC TGCTCGCCCG CCAGGTTGGT GTCCAGAAGA 150 

TCGTTGTCTT CGTCAACAAA ATCGATGCTA TTGATGATCC GGAGATGCTG 200 

GAACTGGTCG AACTCGAGAT GCGTGAGCTG CTGAACAGCT ACGGTTTCGA 250 

GGGTGAAGAG ACTCCGATCA TTTTCGGTTC CGCTCTCTGT GCTCTCGAAG 300 

GACGCCGTGA CGACATCGGT AAAGACAGAA TTGAGCAGCT TATGAACGCT 350 

55 GTCGACACCT GGATCCCCAC TCCTCAGCGT GACCTCGACA AACCTTTCTT 400 

GATGTCTGTC GAGGAAGTGT TCTCTATCGC CGGCCGTGGT ACCGTGGCTT 450 

CTGGTCGTGT CGAGCGTGGT ATCTTGAAGA AGGACTCTGA GGTTGAGATT 500 

GTTGGAGGCT CCTTCGAACC CAAGAAGACC AAAGTCACCG ACATTGAAAC 550 

CTTCAAGAAG AGCTGTGATG AATCGCGTGC TGGTGACAAC TCTGGTCTCC 600 

60 TCCTGCGTGG TATCCGACGT' GAAGACGTCA AGCGTGGTAT GGTCATTGCT 650 
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GTTCCCGGCA GCACCAAGGC 
CCTGACCGAG GCGGAGGGTG 
GTCCCCAAGT CTTCATCCGT 
AGATCTTCCG AGAGATTAGC 
TGCTGACCTC AGCTTCCCTG 
GTGACAACGT CGAGATGATC 
GCTGGTCAAC GCTTCA 



TCACGACAAG TTCCTCGTCT CCATGTACGT 
GTCGTCGTAC TGGCTTCGGT GCCAACTACC 
ACTGCAGGTA AGTTCCCGCA CACCGTGTCC 
GATATATGCT AATGATTCAT CAGACGAGGC 
ACGGCGACCA ATCTCGCAGA GTTATGCCTG 
CTGAAGACCC ACCACCCTGT TGCTGCTGAG 
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2) INFORMATION FOR SEQ ID NO: 499 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

4 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastoschizomyces capitatus 

(B) STRAIN: ATCC 10663 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 499 

TGGTGCTATT ATTGTTGTTG CTGCTTCTGA TGGTCAAATG CCCCAAACCC 50 

GTGAGCACTT GCTTCTTGCT CGTCAAGTTG GTGTTAAGCA CATTGTTGTT 100 

TTCGTTAACA AGATTGATAC TATTGATGAT CCTGAAATGT TGGAACTTGT 150 

CGAAATGGAA ATGAGAGAAC TTCTTTCTTC TTACGGTTTT "GATGGCGATA 200. 

ACACCCCTGT CATTATGGGT TCTGCTCTCT GTGCTCTTGA AGGTCGTGAA 250, 

CCAGAAATTG GTGAACAAAG AATCAACCAA CTCCTTGATG CTATCGATGA 3 00. 

ATACATTCCT ACCCCAGTTC GTGATATGGA CCAACCTTTC TTGATGCCAC 350 

TTGAAGGTGT TTTCTCTATT CCAGGTCGTG GTACTGTTGC CACTGGACGT 400 

GTCTATCGTG GTACTTTGAA GAGAGGTGAA GAAGTTGAAG TTGTTGGCTA 4 50 

CAATGATGCT CCAATCAAGA CCACCGTTAC TGGTATTGAA ATGTTCAAGA 500 

AGGAACTTGA TCAAGCTCAA GCTGGTGACA ACGCTGGTAT TCTTTTGAGA 550 

GGTGTTAAGC GTGAAGACCT TAAGCGTGGT ATGGTTGTTG CTAAACCAGG 600 

TACCGTTAAG CCACACACCA AGTTCCTTGC CTCCATCTAT GTTTTGACTA 650 

AGGAAGAAGG TGGCAGACAC TCTGGCTTTG GTCTTAACTA CAGACCTCAA 700 

CTTTTCCTTG GTTCTGCTGA TGTTACCACT GTCTTGACCT TCCCAGAGGG 750 

TGTTGACCAA AGCACTCAAG TCATGCCAGG TGACAACACT GAAATGGTTT 800 

GCGAACTTGT TCACCCAGTT GCTGTCGAAC AAGGCCAACG TTTCAA 84 6 



2) INFORMATION FOR SEQ ID NO: 500 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida rugosa 

(B) STRAIN: ATCC 96275 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 500 



PCT/CA00/01150 



GGTGCCATTA TTGTTGTTGC TGCCTCTGAT GGACAGATGC CCCAGACCCG 50 

TGAGCATCTT TTGCTTGCCC GCCAAGTCGG TATGCAAAAG GTCGTTGTGT 100 

5 TTGTTAACAA GATTGATACC ATTGATGACC CTGAAATGCT TGATCTTGTC 150 

GAGATGGAGA TGCGTGAACT GTTGAATGAA TATGACTTCG ATGGAGATAA 200 

CTCTCCTGTC ATTATGGGCT CTGCTCTTGC TGCTCTTGAG GACAAGAACC 250 

CCGAGATTGG TAAGGACCGT ATCATGCAGC TCTTGGACGC TGTTGATGAA 3 00 

TGGATCCCTA CCCCCGAGCG TGACCTTGAC AAGCCTTTCA TGATGCCTAT 350 

10 TGAGGCCTCT TTCTCCATTT CTGGTCGTGG TACTGTTGCC ACTGGCCGTG 400 

TCGAGCGTGG TATTCTCAAG AAGGGTGAGG AAGTCGAGAT CGTTGGTTTC 450 

AACAAGCAGC CCCTGAAATC TGTTGTTACT GGTATTGAAA TGTTCAAGAA 500 

GGAACTTGAT CAGGCCCAGG GCGGTGATAA TGCTGGTATC TTGCTTCGTG 550 

GTATTCGTCG TGAGGACTTG CAGCGTGGTA TGGTTTTGGC CAAGCCTGGA 600 

15 ACTGTTAAGG CTCACACCAA GTTCCTTTCC TCCATCTACG TTCTCTCCAA 650 

GGAAGAGGGC GGCCGTCACT CTCCTTTCGG TATGAACTAT CGTCCCCAGA 700 

TGTTCGTTTC TGCAGCTGAT GTCACCGTTA CTCTTACTTT CCCTGAGGGT 750 

GTTGAACAGC ACACTCAGGT CTTCCCTGGT GAGAACACCG AGATGGTTGG 800 

CGAGCTCGTT CACCCTACTG CTATTGAGGT TGGTCAACGC TTCAAC 846 

20 



2 ) INFORMATION FOR SEQ ID NO: 501 



25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 944 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



( ii ) MOLECULE. TYPE : Genomic DNA 



(vi ) ORIGINAL 'SOURCE : 

(A) ORGANISM: Coccidioides immitis 
35 (B) STRAIN: Silveira 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501 

AGTTGTCGTC GTTGCTGCTT CAGACGGTCA AATGTATGCA ACCGAGAGCA 50 

40 CTCCCGGATC TTGGTTTAAA TGGCAC TAAT ATAAGACAGG CCTCAAACTC 100 

GAGAGCATTT ACTTCTCGCC CGTCAGATCG GTATCCAAAA AATCGTCGTC 150 

TTCGTGAACA AGGTTGATGC CATCGAGGAC AAAGAGATGT TGGAGCTTGT 200 

TGAATTGGAG ATGCGTGAAC TCCTAACCAG CTACGGTTTC GAGGGTGAAG 250 

AAACTCCCAT CATTTTTGGC TCTGCTCTCT GTGCCCTCGA AGGAAGACAA 300 

45 CCCGAGATCG GTGTTACCAA GATTGATGAG CTCTTGCAGG CCGTCGACAC 350 

CTGGATTCCC ACTCCTCAGC GTGAGACTGA CAAGCCCTTC TTGATGTCCA 400 

TTGAGGAAGT GTTCTCTATT TCCGGACGAG GAACCGTTGT CTCCGGCCGT 450 

GTGGAGCGTG GTATCCTCAA GAAGGACTCC GAAGTTGAAA TTGTCGGCGG 500 

TTCGCCCGAG CCAATCAAAA CCAAGGTTAC CGATATCGAA ACCTTTAAGA 550 

50 AGTCTTGCGA CGAGTCTCGC GCTGGTGATA ACTCCGGCTT GCTCCTACGA 600 

GGCGTTAAGC GTGAAGATAT TAGCCGTGGC ATGGTCGTCG CTGTACCAGG 650 

AAGTGTCAAG GCCCATACTG AATTCTTAGT TTCGCTTTAC GTCCTCACCG 700 

AAGCTGAGGG TGGGCGCAAA TCTGGATTCA GCAGCAAGTA CCGCCCACAG 750 

ATGTTCATTC GCACTGCCGG TATGTAATAC TGTGATAATT TCGTTGACAT 800 

55 GGTACTGATT GAATTCTATA GACGAAGCGG CTCAGCTCAG CTGGCCCGGA 850 

GAAGATCAAG ACAAGATGGC TATGCCAGGA GACAATATCG AAATGATTTG 900 

CACCACCTTG CACCCAGTTG CCGCCGAGGC TGGCCAGCGA TTCA 944 



60 
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2) INFORMATION FOR SEQ ID NO: 502 
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10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium oxysporum 

(B) STRAIN: WSA-212 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 502 

GCTATCATCG TCGTTGCTGC CTCCGATGGA CAGATGCCCC AGACCCGTGA 50 

ACACTTGCTC CTCGCTCGTC AGGTCGGTGT CCAGCGAATT GTCGTCTTCG 100 

TCAACAAGGT CGATGCCATT GATGACCCCG AGATGCTTGA GCTCGTCGAG 150 

20 ATGGAGATGC GCGAGCTTCT TAACACCTAC GGCTTCGAAG GCGACGACAC 200 

TCCCGTCATC ATGGGCTCTG CTCTTATGTC TCTCCAGAAC CAGCGCCCCG 250 

AGATTGGCAC CGAGAAGATC GATGAGCTCC TTGCTGCCGT CGACGAGTGG 300 

ATCCCAACCC CCGAGCGTGA CCTTGACAAG CCCTTCCTTA TGTCCGTCGA 350 

GGATGTCTTC TCCATTGCCG GCCGTGGTAC CGTCGTGTCT GGCCGTGTGG 400 

25 AGCGTGGTAT TCTGAAGCGT GATCAGGAGA TCGAGCTTGT CGGAAAGGGG 4 50 

CAGGAGGTTA TCAAGACCAA GGTTACCGAC ATTGAGACCT TCAAGAAGTC 500 

TTGTGAGCAG TCCCAGGCTG GTGACAACTC TGGTCTCCTC ATCCGAGGTG 550 

TTCGCCGTGA GGATGTCCGC CGTGGTATGG TTGTCTGCGC TCCTGGCACC 600 

GTCAAGTCTC ATACCCAGTT TCTCGCTTCC CTCTACGTCC TCACCAAGGA 650 

30 GGAGGGTGGC CGACACACCG GTTTCCAGGA GCACTACCGA CCCCAGCTCT 700 

ATCTCCGAAC TGCAGATGAG TCCATTGACC TGACTTTCCC CGAGGGTACT 750 

GAGGATGCCT CCAGTAAGAT GGTCATGCCT GGTGACAACA CCGAGATGGT -800 

TGTCACCATG GGTCACCCCA ATGCCATCGA GGTTGGTCAG CGATTCAAC 849 

■> 

35 

2) INFORMATION FOR SEQ ID NO: 503 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1064 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulation 

(B) STRAIN: G186A5 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503 



' TGGTGCTATT GTCGTTGTTG CTGCTGCTGA CGGCCAAATG TAAGACGCCG 50 

CGAGGGACTG CTGAGGGTTT TATGCTTTTT AGGCCCCCTT GTTTCTGAGA 100 

55 GCATGATGAT ACTAATATTC GGAAACGTAT CTATTAGGCC TCAAACACGT 150 

GAGCATTTGC TCCTTGCCCG ACAGGTCGGT GTCCAAAAGA TCGTCGTTTT 200 

CGTGAACAAA GTCGACGCCC TTGAGGACAA GGAGATGTTG GAGCTTGTCG 250 

AGTTAGAAAT GAGAGAGCTC TTAAACACCT ACGGCTTCGA GGGTGAAGAG 300 

ACACCCATCA TCTTTGGTTC TGCCCTTTGC GCCATGGAAG GCCGTGAGCC 350 

60 TGAGTTGGGA GAAAAGAAAA TTGATGAATT GCTGGAGGCT GTTGATACTT 400 
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GGATCCCCAC ACCACAACGT GATACCGAAA AACCTTTCTT GATGTCCGTT1 4'5© 

GAGGAAGTAT TCTCTATCTC CGGCCGTGGA ACCGTTGCCT CCGGTCGTGT 500 

TGAGCGCGGT GTCCTCAAGA AGGATTCAGA AGTCGAGCTA ATTGGGGGCG 550 

GCTCCACCCC CATCAGGACG AAGGTAACTG ATATCGAAAC TTTCAAGAAA 600 

5 TCCTGTGACG AGTCTAGAGC TGGGGACAAC TCCGGTCTTT TATTGCGTGG 650 

TATCAAGCGT GAAGATATCC GCCGTGGTAT GGTAGTTGCC GTTCCTGGCA 700 

GCGTCAAGGC CCACGACAAG TTCTTGGTGT CGATGTATGT CCTGACCGAA 750 

GCTGAGGGTG GTCGCCGAAC CGGATTCGGC CAGAACTATC GTCCTCAAAT 800 

GTTCATCCGC ACAGCTGGTA TGTCAAAATG GGACCCCTTT TCATAATCCT 850 

10 TTCTTTTTTT CCTTTTCCTC TCTATCTCTC TTTCTGTTTC CTTTCAACTC 900 

GCCTGATTCA CGAAATTAAC TAACCCGTTT GATTATAGAC GAAGCCGCCC 950 

ATCTCAGCTT CCCTAGTGGA GCAGATGAAA GCAAACTCGT TATGCCTGGT 1000 

GACAACGTCG AGATGATCCT CCAGACACAC CGCCCCGTGG CTGCTGAGGC 1050 

CGGCCAGCGA TTCA 1064 

15 



2) INFORMATION FOR SEQ ID NO: 504 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 982 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 
30 (B) STRAIN: ATCC 32071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 504 

TGGTGCTATC GTCGTTGTTG CTGCCTCTGA CGGCCAAATG TAGGGATTTT 50 

35 GCAAGACTGG TGAAAAAATC TAAAGAAAAT AGAAAAGATT GTGCTGATGT 100 

TTGGTATCAG GCCCCAAACA CGCGAGCATC TGCTTCTTGC CAGACAAGTC 150 

GGTGTTCAGA AAATCGTTGT TTTCGTCAAC AAGGTCGATG CTGTAGAGGA 200 

TAAGGAGATG TTGGAGCTTG TCGAATTGGA GATGAGAGAG CTCTTGACCA 250 

CCTATGGGTT CGAGGGTGAG AAGACACCTA TCATCTTTGG TTCTGCGCTC 300 

40 TGTGCTATGG AGGGCCGTCA GCCCGAGTTG GGAGAGCAGA AAATTGATGA 350 

ATTACTCGAG GCTGTGGATA CTTGGATCCC TACGCCACAG CGTGATACTG 400 

ACAAGCCCTT CCTGATGTCC ATTGAGGAGG TGTTCTCTAT CTCTGGACGA 450 

GGAACCGTTG CCTCCGGCCG CGTTGAGCGT GGTATCCTCA AGAAGGACTC 500 

CGAAGTTGAA ATTATTGGCG GCGGTGTTCC CACAATCCTG ACCAAGGTGA 550 

45 CTGATATCGA AACCTTCAAG AAGTCTTGCG ACGAGTCCAG AGCCGGGGAC 600 

AACTCCGGCC TCTTGTTGCG CGGTGTCAAG CGTGAGGATA ■ TCCGCCGTGG 650 

TATGGTCGTT GCAGTTCCCG GAAGCGTCAA AGCACATGAC AGATTCTTGG 700 

TGTCGATGTA CGTTCTGACC GAGGCTGAGG GTGGTCGCCG CACTGGCTTC 750 

GGTCAGAACT ATCGTCCTCA AATGTTCATC CGCACAGCTG GTACGTTCAT 80 0 

50 TCTTTCACTA TATTCCTATA TGCATAGCCC GATCCTCCCA TTAACTAATT 850 

GACACAGACG AGGCTGCTGA ACTCAGCTGG CCTGATGGAG ACGACGAAGC 90 0 

CAAAATGGTC ATGCCCGGTG ACAATGTTGA AATGGTCCTG AAGTCACACC 950 

GCCCGGGGCC GCTGAGGCTG GACAGCGATT CA 982 



2 ) INFORMATION FOR SEQ ID NO: 505 



( i ) SEQUENCE CHARACTERISTICS : 
60 (A) LENGTH: 931 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffei 

(B) STRAIN: ATCC 58950 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 505 



CGCTGTTGTT GTCGTCGCTG CTTCTGATGG TCAAATGTAA CATATCCACG 50 

AGCTGCCAAT TATGGACACT GCTGATAAGA ATAGGCCCCA AACCCGTGAG 100 

15 CACTTGCTCC TCGCCCGTCA GGTCGGTGTT CAAAAGATCG TCGTCTTCGT 150 

CAACAAGGTT GATGCCGTCG AGGACCCCGA GATGTTGGAA CTTGTCGAAT 200 

TGGAAATGCG TGAACTCTTG ACCACCTACG GTTTCGAGGG TGAAGAGACC 250 

CCTATCATTT TCGGATCCGC TCTTTGCGCC TTGGAAGGCC GCAAGCCCGA 300 

GATTGGCGAA CAGAAGATTG ACGAGCTCAT GAACGCCGTT GATACCTGGA 350 

20 TCCCCACCCC CCAGCGTGAC CTTGACAAGC CCTTCTTGAT GTCCGTTGAG 400 

GAAGTTTTCT CCATCTCTGG TCGTGGTACC GTTGCATCTG GTCGTGTTGA 450 

GCGTGGTATT TTGCGCAAGG ATTCTGAGGT TGAGATTATC GGATACCAGA 500 

AGAACCCTAT CAAGACCAAG GTTACCGACA TTGAGACCTT CAAGAAGTCT 550 

TGCGATGAAT CTCGTGCTGG TGACAACTCT GGCTTGCTTC TCCGTGGTAT 600 

25 CAAGCGTGAG GACATTCGTC GTGGTATGGT TATCGCTGCT CCTGGAACCA 650 

CCAAGGCTCA TGACAACTTC TTGGTCTCCA TGTATGTCTT GACTGAGGCT 700 

GAAGGTGGTC GTCGTACTGG ATTCGGCGCC AACTACCGTC CTCAAGCTTT 750 

CATCCGTACT GCCGGTATGT TCCCTTTCAA AGTCAATTAA TGAGCGATTT 800 

GCTAACGAGT TATAGATGAG GCTGCTACTC TCAGCTTCCC CGGTGACGAT 850 

30 CAGTCCAAGC AGGTCATGCC CGGTGACAAC GTTGAGATGA TCTTGAAGAC 900 

ACACCGTCCC GTTGCCGCCG AAGCTGGTCA G ' 931 



35 2) INFORMATION FOR SEQ ID NO: 506 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 182 05 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506 



50 



TGGTGCTATT ATTGTTGTTG CTGCTTCTGA TGGTCAAATG CCTCAAACCA 50 

GAGAACATTT RTTATTGGCT AGACAAGTTG GTGTTCAACA CATTGTTGTC 100 

TTTGTTAACA AAGTTGATAC TATTGATGAC CCAGAAATGT TGGAATTAGT 150 

TGAAATGGAA ATGAGAGAAT TGTTAAGTAC TTATGGTTTT GATGGTGATA 2 00 

55 ACGTCCCAGT TGTTATGGGT TCTGCTTTAT GTGCCTTGGA AGGTCGTGAA 250 

GAAGAAATTG GTGTCAAAGC TATTGATAAA TTATTAGCTG CTGTTGATGA 3 00 

ATATATCCCA ACCCCACAAA GAGATTTAGA AAAACCATTC TTGATGGGTG 350 

TTGAAGATGT CTTYTCAATC TCAGGTAGAG GTACCGTTGT TACTGGTCGT 400 

GTTGAACGTG GTAACTTGAA GAAAGGTGAT GAAGTTGAAA TTGTTGGTTT 450 

60 AAACAAAACT CCATTGAAAA CTACTGTYAC NGGTATTGAA ATGTTCAAAA 500 
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AAGAATTGGA CCAAGCTATG 
GGTATCAAAA GAGATGACAT 
TACCATCTCW GCTCACACTA 
AAGAAGAAGG TGGTCGTCAC 
TTATTCATCA GAACTGGTGA 
TGGTGATTCA TCTCAACAAA 
GTGAATTGGT TCACCCAACT 



GCTGGTGATA 
YAAAAGAGGT 
AATTCTTAGC 
TCAGGTTTTG 
TGTTACCGTT 
TCTTACCAGG 
GCTTTAGAAG 



ACTGTGGTAT CTTATTACGT 
ATGGTTATTG CTAAAACCGG 
CTCAATGTAT ATTTTGACTA 
GTGAACATTA CAGACCTCAA 
GTTTTAACCT TYCCAGAAGG 
TGACAATGTC GAAATGGTTT 
CTGGTCAAAG ATTCAA 



550 
600 
650 
700 
750 
800 
846 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 507 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 964 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: WSA-225 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 507 

GGAGCTGTTG TCGTTGTCGC AGCTTCTGAC GGTCAAATGT AATTGAATGC 50 

CCGCCCAGAC GGATGAAAGG "ATTTGACGTT TCTAACAT C A GTCTAGGCCT 100 

CAGACCAGAG AACATTTGCT CCTTGCCCGC CAGGTCGGTG TCCAGAAGCT 150 

GGTCGTTTTC GTTAACAAGG TCGATGCCGT TGAGGACCCA GAGATGTTGG 200' 

AGCTTGTCGA ACTTGAAATG CGTGAACTCC TCAGCCACTA CAGTTTTGAG 250 

GGTGAGGAGA CCCCCATCAT TTTTGGCTCT GCTCTCTGTG CCCTCGAGTC 300 

CCGTCGACCT GAGCTTGGTG TCGAGAAGAT TGACGAGCTA TTGAACGCCG 350 

TCGACACCTG GATCCCCACC CCCGAGCGCG CCACTGATAA GCCTTTCCTC 400 

ATGTCCATTG AGGAAGTGTT CTCTATCTCT GGTCGTGGTA CCGTCGTCTC 450 

CGGTCGTGTT GAGCGTGGTA TCCTCAAGAA GGATTCCGAC GTCGAAATTG 500 

TTGGTGGCTC TACCACCCCT ATCAAGACCA AGGTCACAGA TATCGAAACC 550 

TTCAAGAAGT CCTGCGATGA ATCTCGAGCT GGTGACAACT CTGGTCTCCT 600 

TCTCCGAGGT ATCAAGCGTG AGGACTTGAA GCGTGGAATG GTTGTTGCTG 650 

CCCCCGGATC CACCAAGGCT CACACCGACT TCATGGTCTC CCTCTACGTC 700 

CTGACTGAGG CTGAGGGTGG TCGTTCCAAC GGCTTCACCC ACAAGTACCG 750 

CCCTCAAATG TTCATCCGTA CTGCTGGTAT GTAACCAAAG TTTCCGCTAT 800 

TTACTAAGTA GATCATTGCT AACTTGTATT CCCTTCCGTA GACGAAGCCG 850 

CATCTTTCAG CTGGCCTGGA GAAGACCAAG ACAAGAAGGC TATGCCTGGT 900 

GACAACGTCG AGATGATTTG CAAAACCCTC CACCCCATTG CTGCCGAGGC 950 

TGGCCAACGA TTCA 964 



2) INFORMATION FOR SEQ ID NO: 508 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: .844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Yarrowia lipolytica 

(B) STRAIN: ATCC 3 8295 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 508 

5 

GGTGCTATCA TTGTCGTTGC TGCTGGAGAC GGTTCCAAGC CCCAGACCCG 50 

AGAGCATCTG CTGCTTGCTC GACAGGTCGG TGTCCAGAAC CTGGTTGTGT 100 

TTGTTAACAA GGTTGATCAG ATTGATGATA AGGAGATTCT TGAGCTCGTT 150 

GACATGGAGA TGCGAGATCT GCTGACCCAG TACGGTTTTG ATGGTGACAA 200 

10 CACCCCCGTT GTCATGGGCT CTGCTCTGTG CGCTCTTGAG GGCAAGCAGA 250 

AGGATATTGG AGAGGACGCC ATCATGGCCC TTATGGATGC CGTTGATGAG 3 00 

CACATCCCTA CCCCTAACCG TGACCTTGAG AAGCCCTTCC TGATGCCCGT 350 

TGAGGACGTT TTCTCCATCT CTGGCCGAGG AACTGTTGTT ACTGGCCGAG 400 

TCGAGCGAGG AAACCTGAAG AAGGGTGAGG AAATCGAGAT TGTTGGCTAC 450 

15 AACAACAAGC CCATCAAGGC TGTTGTTACC GGTATTGAGA TGTTCAAGAA 500 

GGAGCTCGAG TCCGCCATGG CCGGTGACAA CGCCGGTATC CTGCTCCGAG 550 

GTATCAAGCG AGACGAGATC AAGCGAGGTA TGGTCATGTG CAAGCCTGGC 600 

ACCGTCAACG CCCACACCAA GTTCCTTGCT TCTCTTTACA TCATCCCCAC 650 

CGAGGAGGGT GGTCGAACCA GCTCTTTCGG CGCCAACTAC CGACCCCAGA 700 

20 TGTTCATCCG AACTTCTTCC GTCACCGCCA CTCTCACCTT CCCCGAGGGT 750 

ACCGACGAGT CCCAGACCGT CAACCCCGGT GACAACACTG AGATGGTTCT 800 

CGAGCTTGTT CACCCTACCG CCATTGAGGT CAACCAGCGA TTCA 844 



25 

2) INFORMATION FOR SEQ ID NO: 509 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1067 bases 
"30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double" 

(D) TOPOLOGY: Linear 

r „. 

t 

(ii) MOLECULE TYPE: Genomic DNA 

35 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bigemina 

(B) STRAIN: Suarez-2 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 509 

CTTGGACAAG CTGAAGAGCG AGCGTGAGCG TGGTATCACC ATTGACATTA 50 

CCCTGTGGAA GTTCGAAACT GGCAAGTACT ACTACACCGT CATTGACGCC 100 

CCCGGTCACC GTGACTTCAT TAAGAACATG ATTACGGGTA CCTCCCAGGC 150 

45 CGATGTTGCT ATGCTTGTCG TGCCCGCCGA GGCTGGTGGT TTCGAAGCTG 200 

CCTTCTCTAA GGAAGGTCAG ACCCGTGAGC ACGCTCTTTT GGCCTTCACC 250 

CTTGGTGTCA AGCAGATCAT TTGCGCCATC AACAAGATGG ACAAGTGCGA 300 

CTACAAGGAG GACCGTTACA GCGAAATCCA GAAGGAAGTT CAGGGTTACC 350 

TGAAGAAGGT CGGTTACAAC ATCGAGAAGG TGCCTTTCGT CGCCATCTCC 400 

50 GGTTTCATGG GTGACAACAT GGTTGAGCGC TCCACCAACA TGCCGTGGTA 450 

CAAGGGCAAG ACCTTGGTCG AGGCCCTCGA CATGATGGAG CCCCCGAAGA 500 

GGCCCGTCGA CAAGCCCCTG CGTCTTCCCC TCCAGGGTGT GTACAAGATC 550 

GGTGGTATCG GTACCGTCCC TGTCGGTCGT GTGGAGACTG GTCAGCTCAA 600 

GGCCGGTATG GTCCTCACCT TCGCCCCCAA CCCGATCACT ACTGAGTGCA 650 

55 AATCCGTCGA AATGCACCAC GAAGTTATCG ATGTTGCCAG CCCTGGTGAC 700 

AACGTTGGTT TCAACGTGAA GAACGTGTCC ACCTCTGACA TCCGCACTGG 750 

TCACGTCGCT TCTGACTCCA AGAACGACCC CGCCAAGGCC GCCGTGTCCT 800 

TCACCGCCCA GGTCATCATC TTGAACCACC CTGGTACCAT CAAGGCCGGT 850 

TACTCCCCTG TGGTTGACTG CCACACTGCC CACATCTCGT GCAAATTCGA 900 

60 CGAGATCACC AGCCGTATGG ACAAGCGTAC CGGTAAGGCC CTTGAGGAGA 950 

313 
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ACCCCAAGAC CATCAAGAAC GGCGACGCCG 
TGCAAGCCCA TGGTCGTCGA GGCCTTCACT 
TTTCGCCGTG CGTGACG 
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CTATGGTCGT CCTGAAGCCG 1000 
GAATACGCTC CCCTTGGTCG 1050 

1067 



2) INFORMATION FOR SEQ ID NO: 510 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 104 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bovis 

(B) STRAIN: Suarez-3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 510 

GTGAACGTGA ACGTGGTATT ACTATTGATA TTACCTTATG GAAGTTCGAG 50 

ACCACCAAGT ACTACTACAC CGTCATTGAT GCCCCTGGTC ACCGTGACTT 100 

CATCAAGAAC ATGATTACGG GTACTTCTCA AGCCGATGTT GCTATGCTTG 150 

TTGTACCAGC TGAGGCTGGT GGTTTCGAGG CCGCTTTCTC CAAGGAAGGA 2 00 

CAGACCCGTG AGCACGCTCT TTTGGCTTTC ACCCTTGGTG TCAAACAGAT 250 

CATCTGTGCC ATTAACAAGA TGGACAAGTG CGACTACAAG GAGGACCGTT 300 

ACAGTGAAAT CCAGAAGGAA GTCCAGGGTT ACCTCAAGAA GGTCGGTTAC 350 
AATATTGAGA AGGTGCCCTT CGTTGCCATC . TCCGGTTTCA TGGGAGACAA * 400 

CATGGTTGAG CGTTCCACCA ACATGCCCTG . GT AT AAGGG A AAGACATTGG 450 

TCGAGGCCCT TGATCAGATG GAACCCCCAA AGAGGCCCGT TGACAAGCCA 500 

CTTCGTCTTC CCCTCCAGGG TGTCTACAAG ATCGGTGGTA' TCGGTACCGT 550 

CCCCGTCGGT CGTGTTGAAA CTGGTATGTT GAAGGCTGGT ATGATTCTAA 600 

CCTTTGCTCC TAACCCAATC ACCACTGAAT GCAAATCCGT TGAAATGCAC 650 

CACGAAACCG TTGAGGTTGC TTACCCCGGT GACAACGTCG GTTTCAACGT 700 
AAAGAACGTT TCTACTTCTG ACATTCGCAG TGGTCACGTT GCCTCTGATT 4 750 

CTAAGAACGA CCCTGCCAAG GCTGCTGTTT CCTTCACTGC CCAGGTCATT 800 

GTGCTCAACC ACCCTGGTAC CATTAAGGCC GGTTACTGCC CCGTCGTCGA 850 

TTGCCACACC GCTCACATTT CATGTAAATT CGAAGAGAT C ACCAGCCGTA 900 

TGGACAAGCG TACCGGTAAA TCTCTTGAGG AAAACCCCAA GACCATCAAG 950 

AACGGTGACG CTGCCATGGT TGTGCTCAAG CCAATGAAGC CCATGGTTGT 1000 

CGAATCCTTC ACTGAGTATG CTCCTCTTGG TCGTTTCGCT GTTCGTGAC 1049 



2) INFORMATION FOR SEQ ID NO: 511 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1070 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Crithidia fasciculata 

(B) STRAIN: ATCC 11745 

314 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 511 

* 

TGGACAAGCT GAAGGCGGAG CGCGAGCGCG GTATCACGAT CGATATTGCC 50~ 

CTGTGGAAGT TCGAGTCGCC CAAGTCCGTG TTCACGATCA TCGATGCCCC 100 

5 CGGCCACCGC GACTTCATCA AGAACATGAT CACCGGCACC TCCCAGGCCG 150 

ATGCCGCCAT TCTGATGATT GACTCGACCC AGGGTGGCTT CGAGGCTGGC 200 

ATCTCCAAGG ACGGCCAGAC CCGCGAGCAC GCCCTGCTTG CCTTCACGCT 250 

GGGCGTGAAG CAGATGGTTG TGTGCTGCAA CAAGATGGAC GACAAGACGG 300 

TGCAGTACGC CCAGGCCCGC TACGAGGAGA TCAGCAAGGA GGTCGGCGCG 350 

10 TACCTGAAGC GCGTGGGCTA CAACCCGGAG AAGGTGCGCT TCATCCCGAT 400 

CTCGGGCTGG CAGGGCGACA ACATGATCGA GAAGTCCGAC AACATGTCGT 450 

GGTACAAGGG TCCCACGCTG CTGGAGGCGC TCGACCTGCT GGAGGCCCCC 500 

GTGCGTCCGG TGGACAAGCC GCTGCGCCTG CCCCTGCAGG ACGTGTACAA 550 

GATCGGCGGT ATCGGCACTG TGCCCGTGGG CCGTGTGGAG ACCGGCGTGA 600 

15 TGAAGCCGGG CGACGTTGTG GTGTTCGCGC CTGCCAACGT GACGACCGAG 650 

GTGAAGTCGA TCGAGATGCA CCACGAGCAG CTGGCTGAGG CCGTGCCCGG 700 

CGACAACGTG GGCTTCAACG TGAAGAACGT GTCCGTGAAG GATATCCGCC 750 

GTGGTAACGT GTGCGGCAAC ACGAAGAGCG ACCCCCCGAA GGAGGCGGCC 800 

GACTTCACTG CCCAGGTGAT CGTGCTGAAC CACCCCGGCC AGATCAGCAA 850 

20 CGGCTACGCG CCGGTGCTGG ACTGCCACAC GAGCCACATC GCGTGCAAGT 900 

TCGCGGACAT CGAGTCCAAG ATCGACCGCC GCTCTGGCAA GGAGCTGGAG 950 

AAGAGC CCGA AGGCCATCAA GTCCGGCGAT GCGGCCATCG TGAAGATGAT 1000 

CCCGCAGAAG CCGATGTGCG TGGAGGTGTT CAACGACTAC CCGCCGCTGG 1050 

GCCGCTTCGC TGTCCGCGAT 1070 

25 



2 ) INFORMATION FOR SEQ ID NO: 512 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: 1052 bases 

(B) : TYPE: Nucleic acid 

(C) STRANDEDNESS : Double - 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Entamoeba histolytica 
40 (B) STRAIN: HM1-IMMS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512 

GCTGAAAGAG AAAGAGGAAT TACTATTGAT ATTTCATTAT GGAAATT CGA 50 

45 AACATCTAAA TACTACTTCA CTATTATTGA TGCCCCAGGT CAYAGAGATT 100 

TCATTAAGAA CATGATTACT GGAACTTCAC AAGCTGATGT TGCCATCCTT 150 

ATTGTTGCTG CTGGTACTGG WGAATTTGAA GCTGGTATTT CAAAGAATGG 200 

ACAAACCAGA GAACAYATTC TTCTTTCATA CACTCTTGGA GTTAAACAAA 250 

TGATTGTTGG WGTTAACAAG ATGGATGCTA TTCAATATAA ACAAGAAAGA 300 

50 TATGAAGAAA TTAAGAAAGA AATTAGTGCA TTCCTTAAGA AGACWGGATA 350 

TAATCCAGAC AAGATTCCAT TTGTCCCAAT TTCAGGATTC CAAGGAGATA 400 

ATATGATTGA ACCATCAACC AACATGCCAT GGTACAAAGG ACCAACATTA 450 

ATTGGAGCAC TTGATTCAGT CACACCACCA GAAAGACCAG TTGATAAACC 500 

ACTTAGACTT CCACTTCAAG ATGTTTAYAA GATTTCAGGT ATTGGAACTG 550 

55 TACCATGTGG AAGAGTTGAA ACTGGAGTTC TTAGACCAGG AACTATTGTT 600 

CAATTTGCAC CATCAGGAGT TTCATCTGAA TGTAAATCAG TTGAAATGCA 650 

TCACACAGCA CTTGCTCAAG CTATTCCAGG TGATAATGTT GGATTCAATG 700 

TTAGAAAYTT AACAGTTAAA GATATTAAGA GAGGAAATGT AGCATCAGAT 750 

GCTAAGAATC AACCAGCTGT TGGATGTGAA GATTTCACTG CTCAAGTCAT 800 

60 TGTCATGAAC CATCCAGGAC AAATTAGAAA GGGATATACA CCAGTTCTTG 850 
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ATTGCCATAC ATCACACATT GCATGTAAAT 
ATTGATAGAA GAACAGGTAA ATCCATGGAA 
TAAGAATGGA GATTCAGCAC TTGTTAAGAT 
GTGTTGAAGA ATTTGCTAAA TTCCCACCAT 
5 GA 



PCT/CA00/01150 

TCGAAGAATT ATTA^GCAAG 90^ 
GGAGGAGAAC CAGAAtATAT 9^0 
TGTTCCAACT AAACCACTTT 1000 
TGGGAAGATT TGCTGTTAGA 1050 

1052 



2) INFORMATION FOR SEQ ID NO: 513 



10 



( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 1082 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Giardia lamblia 

(B) STRAIN: Faubert-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 513 

25 GACGAGCGCG AGCGCGGGAT CACGATCAAC ATCGCGCTCT GGAAGTTCGA 50 

GACGAAGAAG TACATCGTCA CGATCATCGA CGCCCCGGGC CACCGCGACT 100 

TCATCAAGAA CATGATCACG GGGACGTCCC AGGCCGACGT CGCGATCCTC 150 

GTCGTCGCGG CGGGCCAGGG CGAGTTCGAG GCCGGGATCT CGAAGGACGG 200 

CCAGACGCGC GAGCACGCGA CCCTTGCGAA CACGCTCGGG ATCAAGACGA 250 

30 TGATCATCTG CGTCAACAAG ATGGACGACG GCCAGGTCAA GTACTCGAAG 3 00 

GAGCGCTACG ACGAGATCAA , GGGCGAGATG ATGAAGCAGC TCAAGAACAT 3 50 

CGGCTGGAAG AAGGCCGAGG AGTTCGACTA CATCCCGACG TCCGGCTGGA 400 

CCGGGGACAA CATCATGGAG AAGTCCGACA AGATGCCCTG GTACGAGGGC 450 

CCGTGCCTGA TCGACGCGAT CGACGGGCTC AAGGCCCCGA AGCGCCCGAC 500 

3 5 CGACAAGCCC CTCCGCCTCC CGATCCAGGA CGTCTACAAG ATCTCGGGCG 550 

TCGGGACCGT CCCCGCGGGC CGCGTCGAGA CGGGCGAGCT CGCGCCCGGG 600 

ATGAAGGTCG TCTTCGCCCC GACGTCCCAG GTCTCGGAGG TCAAGTCCGT 650 

CGAGATGCAC CACGAGGAGC TCAAGAAGGC CGGGCCCGGG GACAACGTCG 700 

GCTTCAACGT CCGCGGGCTC GCCGTCAAGG ACCTCAAGAA GGGCTACGTC 750 

40 GTCGGGGACG TGACGAACGA CCCGCCCGTC GGCTGCAAGA GCTTCACCGC 800 

CCAGGTCATC GTCATGAACC ACCCGAAGAA GATCCAGCCC GGCTACACGC 850 

CCGTCATCGA CTGCCACACC GCGCACATCG CGTGCCAGTT CCAGCTCTTC 900 

CTCCAGAAGC TCGACAAGCG CACGCTCAAG CCCGAGATGG AGAACCCGCC 950 

CGACGCAGGC CGCGGCGATT GCATCATCGT CAAGATGGTC CCCCAGAAGC 1000 

45 CCCTGTGCTG CGAGACGTTC AACGACTACG CGCCCCTCGG CCGCTTCGCC 1050 

GTCCGCGACA TGCGCCAAAC CGTTGCCGTC GG 1082 



50 2) INFORMATION FOR SEQ ID NO: 514 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

316 
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(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 30816 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514 

5 

ACGCGTGGGT GCTCGACAAG CTGAAGGCGG AGCGCGAGCG CGGCATCACG 50 

ATCGACATTG CGCTGTGGAA GTTCGAGTCG CCCAAGTCCG TGTTCACGAT 100 

CATCGATGCG CCCGGCCACC GCGACTTCAT CAAGAACATG ATCACGGGCA 150 

CGTCGCAGGC GGACGCCGCC ATCCTGATGA TCGACTCGAC GCATGGTGGC 200 

10 TTCGAGGCTG GCATCTCGAA GGACGGCCAG ACCCGCGAGC ACGCGCTGCT 250 

TGCCTTCACT CTTGGCGTGA AGCAGATGGT GGTGTGCTGC AACAAGATGG 3 00 

ACGACAAGAC GGTGACGTAC GCGCAGTCGC GCTACGATGA GATCAGCAAG 350 

GAGGTGGGCG CGTACCTGAA GCGCGTGGGC TACAACCCGG AGAAGGTGCG 4 00 

CTTCATCCCG ATCTCGGGCT GGCAGGGCGA CAACATGATC GAGAAGTCGG 4 50 

15 ACAACATGCC GTGGTACAAG GGTCCCACGC TGCTGGACGC GCTCGACATG 500 

CTGGAGCCGC CGGTGCGCCC GGTGGACAAG CCGCTGCGCC TGCCCCTGCA 550 

GGACGTGTAC AAGATCGGCG GTATCGGGAC GGTGCCCGTG GGGCGCGTGG 600 

AGACCGGCAT CATGAAGCCG GGCGACGTGG TGACGTTCGC GCCCGCCAAC 650 

GTGACGACTG AGGTGAAGTC GATCGAGATG CACCACGAGC AGCTGGCGGA 700 

20 GGCGCAGCCC GGCGACAACG TCGGCTTCAA CGTGAAGAAC GTGTCGGTGA 750 

AGGACATCCG CCGTGGTAAC GTGTGCGGCA ACTCGAAGAA CGACCCGCCG 800 

AAGGAGGCGG CCGACTTCAC GGCGCAGGTG ATCGTGCTGA ACCACCCCGG 850 

CCAGATCAGC AACGGCTACG CGCCGGTGCT GGACTGCCAC ACGAGCCACA 900 

TTGCGTGCCG CTTCGCGGAA ATCGAGTCCA AGATCGACCG CCGCTCCGGC 950 

25 AAGGAGCTGG AGAAGAACCC CAAGGCGATC AAGTCTGGCG ATGCCGCGAT 1000 

CGTGAAGATG GTGCCGCAGA AGCCGATGTG CGTGGAGGTG TTCAACGACT 1050 

ACGCGCCGCT GGGCCGCTTT GCCGTGCGCG ACATGCGCCA AACCGTTG 1098 



30 

2 ) INFORMATION FOR SEQ ID NO: 515 

» *. 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1104 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania aethiopica 

(B) STRAIN: ATCC 50119 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 515 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

50 ACGTCGCAGG CGGACGCCGC CATCCTGATG ATCGACTCGA CGCATGGTGG 200 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC TCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 3 00 

GACGACAAGA CGGTGACGTA CGCGCAGTCG CGCTACGAGG AGATCAGCAA 350 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 4 00 

55 GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAAGTCG 450 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGCCGCGTG 600 

GAGACCGGCA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

60 CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 
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AGGCGCAGCC CGGCGACAAC GTCGGCTTC& 
AAGGACATCC GCCGTGGCAA CGTGTGCGGC 
GAAGGAGGCG GCCGACTTCA CGGCGCAGGT 
GCCAGATCAG CAACGGCTAC GCGCCGGTGC 
5 ATTGCGTGCC GCTTCGCGGA AATCGAGTCC 
CAAGGAGCTG GAGAAGAACC CCAAGGCGAT 
TCGTGAAGAT GGTGCCGCAG AAGCCGATGT 
TACGCGCCGC TGGGCCGCTT TGCCGTGCGC 
CGTC 



PCT/CA00/0115O 

M-rcMSffir c^s^gstg - -re g 

AACTCGAAGA ACGACCCGCC 8 QQ 

GATCGTGCTG AACCACCCCG 850 

TGGACTGCCA CACGAGCCAC 900 

AAGATCGACC GCCGCTCCGG 950 

CAAGTCTGGC GATGCCGCGA 1000 

GCGTGGAGGT GTTCAACGAC 1050 

GACATGCGCC AAACCGTTGC 1100 

1104 



2) INFORMATION FOR SEQ ID NO: 516 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1106 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 
25 (B) STRAIN: ATCC 30815 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 516 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

ACGTCGCAGG CGGACGCCGC CATCCTGATG ATCGACTCGA CGCATGGTGG 200 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC KCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 3 00 

GACGACAAGA CGGTGACGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 350 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 400 

GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAAGTCG 450 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGGCGCGTG 600 

GAGACCGGCA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 

AAGGACATCC GCCGTGGTAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGATCAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

ATTGCGTGCC GCTTCGCGGA AATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GATGCCGCGA 1000 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTTGC 1100 

CGTCGG 1106 



30 
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50 



55 2) INFORMATION FOR SEQ ID NO: 517 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1099 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

318 
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(D) TOPOLOGY: Linear 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania donovani 

(B) STRAIN: ATCC 50212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 517 

10 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

ACGTCGCAGG CGGACGCCGC CATCCTGATG ATCGACTCGA CGCATGGTGG 200 

15 CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 2 50 

TTGCCTTCAC GCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 300 

GACGACAAGA CCGTGACGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 350 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 400 

GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAGGTCG 450 

20 GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

AGGACGTGTA CAAGATCGGC GGTATCGGGA CTGTGCCCGT GGGCCGCGTG 600 

GAGACCGGCA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

25 AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 

AAGGACATCC GCCGTGGCAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGATCAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

ATTGCGTGCC GCTTCGCGGA AATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

30 CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GATGCCGCGA 1000 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTTG 1099 



35 



2) INFORMATION FOR SEQ ID NO: 518 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1098 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania infantum 

(B) STRAIN: MOU 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

55 ACGTCGCAGG CGGACGCCGC CATCCTGATG ATCGACTCGA CGCATGGTGG 200 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC GCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 3 00 

GACGACAAGA CCGTGACGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 3 50 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 400 

60 GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAGGTCG 450 

319 
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GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCT££3ACAT SOjb 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

AGGACGTGTA CAAGATCGGC GGTATCGGGA CTGTGCCCGT GGGCCGCGTG 600 

GAGACCGGCA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

5 CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 

AAGGACATCC GCCGTGGCAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGATCAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

10 ATTGCGTGCC GCTTCGCGGA AATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GATGCCGCGA 1000 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTT 1098 

15 

2) INFORMATION FOR SEQ ID NO: 519 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1071 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania enriettii 

(B) STRAIN: ATCC 5012 0 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 519 



CTCGACAAGC TGAAGGCGGA GCGCGAGCGC GGCATCACGA TCGACATTGC 50 

GCTGTGGAAG TTCGAGTCGC CCAAGTCTGT GTTCACGATC ATCGATGCGC 100 

35 CCGGCCACCG CGACTTCATC AAGAACATGA TCACCGGCAC GTCGCAGGCC 150 

GACGCCGCCA TCCTGATGAT CGACTCGACC CAGGGCGGCT TCGAGGCTGG 200 

CATCTCGAAG GACGGCCAGA CCCGCGAGCA CGCGCTGCTC GCCTTCACGC 250 

TTGGTGTGAA GCAGATGGTG GTGTGCTGCA ACAAGATGGA CGACAAGACG 300 

GTGCAGTACT CGCAGGCGCG CTACGAGGAG ATCAGCAAGG AGGTGGGCGC 350 

4 0 GTACCTGAAG CGCGTCGGCT ACAACCCGGA GAAGGTGCGC TTCATCCCGA 4 00 

TCTCGGGCTG GCAGGGCGAC AACATGATCG ACAAGTCGGA CAGCATGCCG 450 

TGGTACAAGG GGCCCACGCT GCTGGACGCG CTCGACATGC TGGAGGCGCC 500 

GGTGCGCCCG GTGGACAAGC CGCTGCGCCT GCCCCTGCAG GACGTGTACA 550 

AGATCGGCGG TATCGGGACG GTGCCCGTGG GCCGCGTGGA GACCGGCATC 600 

45 ATGAAGCCTG GCGACGTGGT GACGTTTGCG CCCGCCAACG TGACGACGGA 650 

GGTGAAGTCG ATCGAGATGC ACCACGAGCA GCTGGCGGAG GCGGTGCCCG 7 00 

GCGACAACGT CGGCTTCAAC GTGAAGAACG TGTCGGTGAA GGACATCCGC 750 

CGTGGTAACG TGTGCGGCAA CTCGAAGAAC GACCCGCCGA AGGAGGCGGC 800 

CGACTTCACG GCGCAGGTGA TCGTGCTGAA CCACCCTGGC CAGATCAGCA 850 

50 ACGGCTACGC GCCGGTGCTG GACTGCCACA CGAGCCACAT CGCGTGCCGC 900 

TTCGCGGATA TCGAGTCCAA GATCGACCGC CGCTCTGGCA AGGAGCTGGA 950 

GAAGAACCCC AAGGCGATCA AGTCCGGCGA TGCGGCCATC GTGAAGATGG 1000 

TGCCGCAGAA GCCGATGTGC GTGGAGGTGT TCAACGACTA CCCGCCGCTG 1050 

GGGCGCTTCG CTGTGCGCGA C 1071 

55 



2) INFORMATION FOR SEQ ID NO: 52 0 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1071 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania gerbilli 
10 (B) STRAIN: ATCC 50121 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 520 

CTCGACAAGC TGAAGGCGGA GCGCGAGCGC GGCATCACGA TCGACATTGC 50 

15 GCTGTGGAAG TTCGAGTCGC CCAAGTCCGT GTTCACGATC ATCGATGCGC 100 

CCGGCCACCG CGACTTCATC AAGAACATGA TCACGGGCAC GTCGCAGGCG 150 

GACGCCGCCA TCCTGATGAT CGACTCGACG CATGGTGGCT TCGAGGCTGG 2 00 

CATCTCGAAG GACGGCCAGA CCCGCGAGCA CGCGCTGCTT GCCTTCACTC 250 

TTGGCGTGAA GCAGATGGTG GTGTGCTGCA ACAAGATGGA CGACAAGACG 300 

20 GTGACGTACG CGCAGTCGCG CTACGATGAG ATCAGCAAGG AGGTGGGCGC 350 

GTACCTGAAG CGCGTGGGCT ACAACCCGGA GAAGGTGCGC TTCATCCCGA 400 

TCTCGGGCTG GCAGGGCGAC AACATGATCG AGAAGTCGGA CAACATGCCG 450 

TGGTACAAGG GTCCCACGCT GCTGGACGCG CTCGACATGC TGGAGCCGCC 500 

GGTGCGCCCG GTGGACAAGC CGCTGCGCCT GCCCCTGCAG GACGTGTACA 550 

25 AGATCGGCGG TATCGGGACG GTGCCCGTGG GCCGCGTGGA GACCGGCATC 600 

ATGAAGCCGG GCGACGTGGT GACGTTCGCG CCCGCCAACG TGACGACTGA 650 

GGTGAAGTCG ATCGAGATGC ACCACGAGCA GCTGGCGGAG GCGCAGCCCG 700 

GCGACAACGT CGGCTTCAAC GTGAAGAACG TGTCGGTGAA GGACATCCGC 750 

CGTGGTAACG TGTGCGGCAA CTCGAAGAAC GACCCGCCGA AGGAGGCGGC 800 

3 0 CGACTTCACG GCGCAGGTGA TCGTGCTGAA CCACCCCGGC CAGAT CAGC A 850 

ACGGCTATGC GCCGGTGCTG GACTGCCACA CGAGCCACAT TGCGTGCCGC 900 

TTCGCGGAAA TCGAGTCCAA GATCGACCGC CGCTCCGGCA AGGAGCTGGA 950 
GAAGAACCCC AAGGCGATCA AGTCTGGCGA TGCCGCGATC GTGAAGATGG * 1000 

TGCCGCAGAA GCCGATGTGC GTGGAGGTGT TCAACGACTA CGCGCCGCTG 1050 

35 GGCCGCTTTG CCGTGCGCGA C 1071 



2) INFORMATION FOR SEQ ID NO: 521 



40 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1114 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Leishmania hertigi 

(B) STRAIN: ATCC 50125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 521 

55 TCGTTCAAGT ACGCGTGGGT GCTCGACAAG CTGAAGGCGG AGCGCGAGCG 50 

CGGTATCACG ATCGACATTG CGCTGTGGAA GTTCGAGTCG CCCAAGTCGG 100 

TGTTCACGAT CATCGATGCG CCCGGCCACC GCGACTTCAT CAAGAACATG 150 

ATCACCGGCA CGTCGCAGGC GGATGCTGCC ATTCTGATGA TCGATTCGAC 2 00 

GCAGGGTGGC TTCGAGGCTG GCATCTCGAA GGACGGCCAG ACGCGCGAGC 250 

60 ACGCGCTGCT GGCCTTCACG CTGGGCGTGA AGCAGATGGT TGTGTGCTGC 300 
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AACAAGATGG ACGACAAGAC GGTGCAGTAC GCGCAGGCGC GCTACEAGGA 250/ 

GATCAGCAAG GAGGTGGGCG CGTACCTGAA GCGCGTGGGC TACAACCCGG 4(f0 

AGAAGGTGCG CTTCATCCCG ATCTCGGGCT GGCAGGGCGA CAACATGATC 450 

GAGAAGTCCG ACAACATGTC GTGGTACAAG GGTCCCACGC TGCTGGAGGC 500 

5 GCTGGACATG CTGGAGGCGC CGGTGCGCCC GGTGGACAAG CCGCTGCGCC 550 

TGCCCCTGCA GGACGTGTAC AAGATCGGCG GCATTGGCAC GGTGCCGGTG 600 

GGCCGTGTGG AGACCGGCAT CATGAAGCCC GGCGACGTGG TGACGTTCGC 650 

GCCCGCCAAC GTGACGACGG AGGTGAAGTC GATCGAGATG CACCACGAGC 700 

AGCTGCAGGA GGCTGTGCCC GGCGACAACG TCGGCTTCAA CGTGAAGAAC 750 

10 GTGTCGGTGA AGGACATCCG CCGTGGTAAC GTGTGTGGCA ACTCGAAGAA 800 

CGACCCGCCG AAGGAGGCGG CTGACTTCAC GGCGCAGGTG ATCGTGCTGA 850 

ACCACCCCGG CCAGATCAGC AACGGCTACG CGCCGGTGCT GGACTGCCAC 900 

ACCAGCCACA TCGCGTGCCG CTTCGCGGAC ATCGAGTCGA AGATCGACCG 950 

CCGCTCCGGC AAGGAGCTGG AGAAGAACCC CAAGTCCATC AAGTCCGGCG 1000 

15 ACGCCGCCAT CGTGAAGATG GTGCCGCAGA AGCCGATGTG CGTGGAGGTG 1050 

TTCAACGACT ACCCGCCGCT GGGCCGCTTT GCGGTGCGCG ACATGCGCCA 1100 

AACCGTTGCC GTCG 1114 



20 



2) INFORMATION FOR SEQ ID NO: 522 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1106 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

■ (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania major 

(B) STRAIN: ATCC 50122 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

4 0 ACGTCGCAGG CGGACGCCGC CATTCTGATG ATCGACTCGA CGCATGGCGG 200 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC TCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 300 

GACGACAAGA CGGTGACGTA CGCGCAGTCR CGCTACGATG AGATCAGCAA 350 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 400 

45 GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAAGTCG 450 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGCCGCGTG 600 

GAGACCGGCA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

50 CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 

AAGGACATCC GCCGTGGTAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGATCAG CAACGGCTAT GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

55 ATTGCGTGCC GCTTCGCGGA AATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GATGCCGCGA 1000 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTTGC 1100 

CGTCGG 1106 

60 
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2) INFORMATION FOR SEQ ID NO: 523 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1105 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania amazonensis 
15 (B) STRAIN: ATCC 50131 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 523 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

2 0 GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCGAAGTCC GTGTTCACGA 100 

T CAT CGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

ACGTCGCAGG CGGACGCGGC CATCCTGATG ATCGACTCGA CGCATGGTGG 200 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC TCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 3 00 

25 GACGACAAGA CGGTGATGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 350 

GGAGGTGAGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 4 00 

GCTTCATCCC GATCTCGGGG TGGCAGGGCG ACAACATGAT CGACAAGTCG 450 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

30 AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGCCGCGTG 600 

GAGACCGGGA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 

' AAGGACATCC GCCGTGGTAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC .800 

35 GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGATCAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

ATCGCGTGCC GCTTCGCGGA GATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GACGCCGCGA 1000 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

40 TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTTGC 1100 

CGTCG 1105 



45 2) INFORMATION FOR SEQ ID NO: 524 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania mexicana 

(B) STRAIN: ATCC 50156 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524 
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TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGC&TCAC 50 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCGAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

ACGTCGCAGG CGGACGCGGC CATCCTGATG ATCGACTCGA CGCATGGTGG 200 

5 CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC TCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 3 00 

GACGACAAGA CGGTGATGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 350 

GGAGGTGAGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 400 

GCTTCATCCC GATCTCGGGG TGGCAGGGCG ACAACATGAT CGACAAGTCG 450 

10 GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGCCGCGTG 600 

GAGACCGGGA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

15 AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 

AAGGACATCC GCCGTGGTAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGATCAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

ATCGCGTGCC GCTTCGCGGA GATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

20 CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GACGCCGCGA 1000 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTT 1098 



25 

2) INFORMATION FOR SEQ ID NO: 525 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1081 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS.: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tarentolae 

(B) STRAIN: II WT 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 525 

GCTGAAGGCG GAGCGCGAGC GCGGCATCAC GATCGACATT GCGCTGTGGA 50 

AGTTCGAGTC GCCCAAGTCG GTGTTCACGA TCATCGATGC GCCCGGCCAC 100 

CGCGACTTCA TCAAGAACAT GATCACGGGC ACGTCGCAGG CGGACGCGGC 150 

45 CATCCTGATG ATCGACTCGA CGCACGGTGG GTTCGAGGCT GGCATCTCGA 200 

AGGACGGGCA GACGCGCGAG CACGCGCTGC TTGCCTTCAC TCTTGGCGTG 250 

AAGCAGATGG TTGTGTGCTG CAACAAGATG GACGACAAGA CGGTGATGTA 3 00 

CGCGCAGTCG CGCTACGATG AGATCAGCAA GGAGGTGGGC GCGTACCTGA 350 

AGCGCGTGGG CTACAACCCG GAGAAGGTGC GCTTCATCCC GATCTCGGGC 4 00 

50 TGGCAGGGCG ACAACATGAT CGAGAAGTCG GACAACATGC CGTGGTACAA 450 

GGGTCCCACG CTGCTGGACG CGCTCGACAT GCTGGAGGCG CCGGTGCGCC 500 

CGGTGGACAA GCCGCTGCGC CTGCCCCTGC AGGACGTGTA CAAGATCGGC 550 

GGCATCGGCA CGGTGCCCGT GGGCCGCGTG GAGACCGGCA TCATGAAGCC 600 

GGGCGACGTG GTGACGTTCG CGCCCGCGAA CGTGACGACG GAGGTGAAGT 650 

55 CGATCGAGAT GCACCACGAG CAGCTGGCGG AGGCGCAGCC CGGCGACAAC 700 

GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG AAGGACATCC GCCGTGGGAA 750 

CGTGTGCGGT AACTCGAAGA ACGACCCGCC GAAGGAGGCC GCCGACTTCA 800 

CGGCGCAGGT GATCGTGCTG AACCACCCCG GCCAGATCAG CAACGGCTAC 850 

GCGCCGGTGC TGGACTGCCA CACGAGCCAC ATCGCGTGCC GGTTCGCGGA 900 

60 CATCGAGTCC AAGATTGACC GCCGCTCCGG CAAGGAGCTG GAGAAGAACC 950 
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CCAAGGCGAT CAAGTCCGGC GATGCCGCGA TCGTGAAGAT GGTGdCGCAG 

AAGCCGATGT GCGTGGAGAT GTTCAACGAC TACGCGCCGC TTGGCCGCTT 

TGCTGTGCGC GACATGCGCC AAACCGTTGC C 1081 



2) INFORMATION FOR SEQ ID NO: 526 

(i) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 50129 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 526 



l'ooof 

10^6 



AAATACGCGT GGGTGCTCGA CAAGCTGAAG GCGGAGCGCG AGCGCGGCAT 50 

CACGATCGAC ATTGCGCTGT GGAAGTTCGA GTCGCCCAAG TCCGTGTTCA 100 

25 CGATCATCGA TGCGCCCGGC CACCGCGACT TCATCAAGAA CATGATCACG 150 

GGCACGTCGC AGGCGGACGC CGCCATCCTG ATGATCGACT CGACGCATGG 200 

TGGCTTCGAG GCTGGCATCT CGAAGGACGG CCAGACCCGC GAGCACGCGC 250 

TGCTTGCCTT CACKCTTGGC GTGAAGCAGA TGGTGGTGTG CTGCAACAAG 300 

ATGGACGACA AGACGGTGAC GTACGCGCAG TCGCGCTACG ATGAGAT CAG 350 

30 CAAGGAGGTG GGCGCGTACC TGAAGCGCGT GGGCTACAAC CCGGAGAAGG 4 00 

TGCGCTTCAT CCCGATCTCG GGCTGGCAGG GCGACAACAT GATCGAGAAG 450 

TCGGACAACA TGCCGTGGTA CAAGGGTCCC ACGCTGCTGG ACGCGCTCGA 500 

CATGCTGGAG CCGCCGGTGC GCCCGGTGGA CAAGCCGCTG CGCCTGCCCC 550 

TGCAGGACGT GTACAAGATC GGCGGTATCG GGACGGTGCC CGTGGGGCGC 600 

35 GTGGAGACCG GCATCATGAA GCCGGGCGAC GTGGTGACGT TCGCGCCCGC 650 

CAACGTGACG ACTGAGGTGA AGTCGATCGA GATGCACCAC GAGCAGCTGG 700 

CGGAGGCGCA GCCCGGCGAC AACGTCGGCT TCAACGTGAA GAACGTGTCG 750 

GTGAAGGACA TCCGCCGTGG TAACGTGTGC GGCAACTCGA AGAACGACCC 800 

GCCGAAGGAG GCGGCCGACT TCACGGCGCA GGTGATCGTG CTGAACCACC 850 

40 CCGGCCAGAT CAGCAACGGC TACGCGCCGG TGCTGGACTG CCACACGAGC 900 

CACATTGCGT GCCGCTTCGC GGAAATCGAG TCCAAGATCG ACCGCCGCTC 950 

CGGCAAGGAG CTGGAGAAGA ACCCCAAGGC GATCAAGTCT GGCGATGCCG 1000 

CGATCGTGAA GATGGTGCCG CAGAAGCCGA TGTGCGTGGA GGTGTTCAAC 1050 

GACTACGCGC CGCTGGGCCG CTTTGCCGTG CGCGACATGC GCCAAACCGT 1100 

45 TG 1102 



2) INFORMATION FOR SEQ ID NO: 527 



50 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1105 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii)MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Neospora caninum 

325 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 527 

5 GGACRAACTT AAAGCTGAAC GTGAGCGTGG TATCACCATT GATATCTCCC 50 

TGTGGAAATT TGAGACCAGC AAGTACTATG TTACCATCAT TGATGCCCCA 100 

GGACACAGAG ACTTCATCAA AAACATGATT ACAGGCACAT CCCAGGCTGA 150 

CTGTGCTGTC CTGATTGTTG CTGCTGGTGT TGGTGAATTT GAAGCCGGTA 200 

TCTCCAAGAA CGGGCAGACC CGTGAGCATG CCCTTNTGGC TTACACCCTG 250 

10 GGTGTGAAAC AACTAATTGT TGGCGTTAAC AAAAKGGATT CCACTGAGCC 300 

ACCCTATAGC CARAAGAGAT ACGARGAAAT TGTTAAGGAA GTCAGCMCCT 350 

AYNTTAAAAA AATTGGYTAC AACCCCGACA CAGTANCATT TGKGCCAATT 400 

TNTGGCTGGA ATGGTGACAA CATGCTGGAN CCAAGTGCTA ATATGCCATG 450 

GTTCAAGGGA TGGAAAGTCM CCCGTAAGGA CGGCAATGCC AGKGGAACCM 500 

15 CCCTGCTTGA AGCTYTGGAT TGCATTYTGC CACCAAYTTG CCCAACTGAC 550 

AAACCCTTGC GTTTGCCTYT CCAGGATGTC TATAAAATTG GKGGTATTGG 600 

TACTGTCCCT GTGGGTCGTG TGGAGACTGG TGTTCTCAAA CCTGGCATGG 650 

TGGTCACCTT TGCTCCAGTC AATGTAACAA CTGAAGTGAA GTCTGTAGAA 700 

ATGCACCATG AAGCATTGAG TGAAGCCCTT CCTGGGGACA ATGTGGGCTT 750 

20 CAATGTCAAG AACGTGTCTG TCAAAGATGT CCGTCGTGGC AATGTGGCTG 800 

GTGACAGCAA AAATGATCCA CCCATGGAAG CTGCTGGCTT CACAGCTCAG 850 

GTGATTATTT TGAACCATCC AGGCCAAATC AGTGCTGGAT ATGCACCTGT 900 

GCTGGATTGT CACACAGCTC ACATTGCTTG CAAGTTTGCT GAGCTGAAGG 950 

AGAAGATTGA TCGTCGTTCT GGGAAAAAGC TGGAAGATGG CCCTAAATTC 1000 

25 TTGAAATCTG GTGACGCTGC CATCGTTGAT ATGGTTCCTG GCAAGCCCAT 1050 

GTGTGTCGAG AGCTTCTCTG ATTATCCTCC CCTGGGCCGT TTTGCTGTGC 1100 

GTGAC 1105 



30 

2) INFORMATION FOR' SEQ ID NO: 528 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 935 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichomonas vaginalis 

(B) STRAIN: ATCC 30001 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 528 

GCACATCCCA GGCTGATGCT GCTATCCTTG TCATCGACTC CACACTCGGT 50 

GGYTTCGAAG CCGGTATCGC TGAACAAGGC CAGACACGTG AACACGCTCT 100 

TCTTGCCTTC ACACTCGGCA TCAAGCAGGT CATTGTCGCC GTCAACAAGA 150 

50 TGGATGACAA GACAGTCAAC TACAACAAGG CYCGTTTCGA CGAAATCACA 200 

GCCGAAATGA CACGCATCCT TACAGGCATC GGCTACAAGC CAGAAATGTT 250 

CCGCTTCGTC CCAATCTCCG GCTGGGCTGG CGACAACATG ACAGAGAAGT 300 

CTCCAAACAT GCCATGGTAC AATGGCCCAT ACCTTCTTGA AGCCCTCGAT 350 

TCCCTTCAGC CACCAAAGCG CCCATTCGAC AAGCCACTCC GTCTTCCACT 400 

55 CCAGGATGTC TACAAGATCA ACGGTATCGG TACAGTTCCA GTCGGCCGTG 450 

TCGAATCCGG CACAATGAAG CCAGGCATGA TCGTTAACTT CGCCCCATCC 500 

ACAGTTACAG CTGAAGTTAA GTCCATCGAA ATGCACCACG AATCCCTTCC 550 

AGAGGCTCTT CCAGGTGACA ACATCGGCTT CAACGTCAAG AACGTTTCCA 600 

CAGCTGATGT CAAGCGTGGC TACGTCGTTG GTGATACAAA GCGTGACCCA 650 

60 CCAGTCGAAT GCGCTTCCTT CACAGCTCAR ATGATCATCT CCAACCACCC 700 
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GGCAAGAAGG CTACAGAGAA CCCAGAATAC 
TATCGTCGAG GTTGTCCCAT CCAAGCCACT 
5 AGTACCCACC ACTCGGCCGT TTCGCCATCC 
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TTTCGACTGC CACAOAGCTC 750 J 

AGCGTATCGA TCGTCGQCAC 800/ 

ATTCAGAAGG ATGATGCCGC 850 

CGTCGTCGAG TCCTTCCAGG 900 

GTGAT 935 



2) INFORMATION FOR SEQ ID NO: 529 



10 



( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 1065 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Trypanosoma, brucei subsp. brucei 

(B) STRAIN: EATR07 95 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 529 

25 AAGCTGAAGG CTGAGCGCGA ACGTGGTATC ACGATCGACA TTGCACTGTG 50 

GAAATTCGAG TCACCCAAGT CTGTCTTCAC TATTATTGAT GCTCCTGGGC 100 

ACCGTGACTT CATCAAGAAC ATGATCACCG GCACATCGCA AGCCGACGCA 150 

GCCATCCTCA TCATTGCCTC TGCGCAGGGT GAGTTCGAGG CTGGTATCTC 2 00 

CAAGGATGGA CAGACCCGCG AGCACGCGTT GCTGGCCTTC ACTTTGGGTG 250 

30 TGAAGCAGAT GGTTGTGTGC TGCAACAAGA TGGACGACAA GACTGTGAAC 300 

TACGGACAGG AGCGGTATGA CGAGATTGTG AAGGAGGTGT CTGCTTACAT 350 

CAAGAAGGTT GGGTACAACG TGGAGAAGGT GCGCTTCGTC CCCATCTCCG 400 

GATGGCAGGG CGACAACATG ATTGAGAAAT CCGAGAAGAT GCCATGGTAC 450 

AAGGGTCCAA CGCTCCTGGA GGCACTAGAC ATGCTGGAGC CACCAGTGCG 500 

35 TCCGAGCGAC AAGCCCCTGC GTCTGCCACT GCAGGACGTG TACAAGATCG 550 

GTGGTATTGG CACCGTGCCC GTTGGTCGTG TGGAGACCGG CGTGATGAAG 600 

CCTGGTGATG TGGTGACGTT TGCCCCCGCC AACGTGACGA CCGAGGTGAA 650 

ATCGATCGAG ATGCACCACG AGCAGCTCGC TGAGGCGACC CCCGGTGACA 700 

ACGTCGGCTT TAACGTGAAG AACGTTTCTG TAAAGGACAT CCGCCGTGGC 750 

40 AACGTCTGCG GTAACACCAA GAACGACCCC CCAAAGGAGG CCGCCGACTT 800 

CACGGCACAG GTGAT CATC C TGAACCACCC CGGACAGATT GGAAACGGTT 850 

ATGCGCCCGT GCTGGACTGC CACACATCGC ACATTGCCTG CAAGTTCGCG 900 

GAGATCGAGT CGAAGATCGA CCGTCGCTCT GGCAAGGAGC TGGAGAAGGC 950 

TCCCAAGTCG ATCAAGTCTG GCGACGCCGC GATCGTGCGC ATGGTGCCGC 1000 

45 AGAAGCCTAT GTGCGTGGAG GTCTTCAACG ACTACGCGCC ACTCGGCCGC 1050 

TTTGCCGTGC GTGAC 1065 



50 2) INFORMATION FOR SEQ ID NO: 530 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1297 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Crithiaxa rasciculaza 

(B) STRAIN: ATCC 11745 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 530 

5 

TTCGCGGAGG GCGTGCCGCC GGTGCTGACG GCGCTGGACG TGACGGAGGA 50 

CCTCGGCCGC GACGAGCCGC TGACGCTGGA GATTGTGCAG CACTTGGACG 100 

CGAACACCGG CCGCTGCATT GCCATGCAGA CGACGGATCT GCTGAAGCTG 150 

AAGTCGAAGG TTGTGTCGAC GGGCGGCAAC ATCTCCGTGC CGGTTGGCCG 200 

10 CGAGACGCTG GGCCGCATCT TCAACGTGCT CGGCGACGCG ATCGACCAGC 250 

GCGGTGTGGT GGGCGAGAAG ATGCGCATGC CGATCCACGC CGAGGCGCCG 3 00 

AAGCTGGCGG ACCAGGCCGC GGAGGACGCG ATTCTGACGA CCGGCATCAA 350 

GGTGATCGAC CTGATTCTGC CGTACTGCAA GGGTGGCAAG ATCGGGCTGT 4 00 

TCGGCGGTGC TGGTGTGGGC AAGACTGTGA TCATCATGGA GCTGATCAAC 450 

15 AACGTGGCCA AGGGCCACGG TGGTTTCTCC GTGTTCGCCG GCGTTGGCGA 500 

GCGCACCCGC GAGGGCACGG ATCTGTACCT GGAGATGATG CAGTCGAAGG 550 

TCATTGACCT GAAGGGCGAG TCGAAGTGCG TGCTGGTGTA CGGCCAGATG 600 

AACGAGCCCC CGGGTGCGCG TGCGCGTGTT GCGCAGTCTG CGCTGACGAT 650 

GGCGGAGTAC TTCCGTGACG TGGAGGGCCA GAACGTGCTG CTGTTCATCG 700 

20 ACAACATCTT CCGCTTCACC CAGGCCAACT CCGAGGTGTC CGCCCTGCTG 750 

GGCCGCATTC CCGCCGCCGT GGGCTACCAG CCGACGCTTG CCGAGGATCT 800 

TGGTATGCTG CAGGAGCGCA TTACGTCGAC GACGAAGGGC TCGATTACGT 850 

CTGTGCAGGC CGTGTACGTG CCGGCCGATG ATATCACGGA TCCGGCGCCG 900 

GCGACGACCT TCTCGCACCT GGATGCGACG ACGGTGCTGG ACCGCGCGGT 950 

25 TGCCGAGTCT GGCATCTACC CCGCCGTGAA CCCGCTGGAG TGCGCGTCGC 1000 

GTATCATGGA CCCCGATGTG ATCGACGTGG ACC AC TACAA CGTTGCGCAG 1050 

GATATCGTGC AGATGCTGAC CAAGTACAAG GAGCTGCAGG ATATCATTGC 1100 

CGTGCTGGGT ATCGACGAGC TGAGCGAGGA GGACAAGCTT GTGGTGGACC .1150 

GCGCTCGCAA GGTGACGCGC TTCCTGTCGC AGCCGTTCCA GGTGGCCGAG 1200 

30 GTGTTCACCG GCATGACGGG CCACTACGTG CAGCTGGAGG ACACAGTGGA 1250 

= GTCGTTCTCT GGCCTGCTGA TGGGCTCGTA CGACCAGATC CCGGAGA 1297 



35 2) INFORMATION FOR SEQ ID NO: 531 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1298 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 30816 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531 

50 

CTTCTCGGAG GGCGTGCCGC CCGTGCTGAC GGCGCTGGAT GTGACGGAGG 50 

ACCTTGGCCG CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAC 100 

GCGAACACG(3 GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

GAAGTCGAAG GTCGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 200 

55 GTGAGACGCT GGGCCGCATC TTCAAYGTTC TGGGCGACGC GATCGACCAG 250 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 300 

GAAGCTGGCG GATCAGGCCG CGGAGGACAC GATCCTGACG ACCGGCATCA 350 

AGGTGATCGA CCTGATTCTG CCCTACTGCA AGGGTGGCAA GATCGGCCTG 4 00 

TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 450 

60 CAACGTCGCG AAGGGCCACG GCGGTTTCTC CGTGTTTGCC GGCGTTGGCG 500 

328 
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AGCGCACGCG CGAGGGCACG GACCTGTACC TGGAGATGAT GCAGT&3AAG 5§j6 

GTGATTGACC TGAAGGGCGA GTCGAAGTGY GTGCTTGTGT ATGGGCAGAT 600 

GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 700 

5 GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT 750 

GGGCCGCATT CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCGGAGGATC 800 

TTGGTATGCT GCAGGAGCGC ATCACGTCGA CAACGAAGGG GTCGATCACG 850 

TCCGTGCAGG CCGTGTACGT GCCAGCGGAT GATATCACGG ATCCCGCGCC 900 

CGCGACGACG TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGCGCGG 950 

10 TGGCGGAGTC GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG 1000 

CGTATCATGG ACCGTGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 1050 

GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 1100 

CGGTGCTTGG CATCGACGAG CTGAGCGAGG AAGACAAGGT TGTTGTGGAC 1150 

CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 12 00 

15 GGTGTTCACG GGCATGACGG GCCACTACGT GCAGCTGGTC GACACGGTGG 1250 

AGTCGTTCTC TGGCCTGCTG ATGGGGTCGT ACGACCAGAT CCCGGAGA 12 98 



20 2) INFORMATION FOR SEQ ID NO: 532 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1297 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

' (A) ORGANISM: Leishmania aethiopica 

(B) STRAIN: ATCC 50119 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 532 



35 



TTCTCGGAGG GCGTGCCGCC CGTGCTGACG GCGCTGGATG TGACGGAGGA 50 

CCTTGGCCGC GATGAGCCGC TGACGCTGGA GATCGTGCAG CACTTGGACG 100 

CGAACACCGG CCGCTGCATT GCGATGCAGA CGACGGACCT GCTGAAGCTG 150 

AAGTCGAAGG TTGTGTCGAC CGGCGGCAAC ATCTCTGTGC CGGTGGGCCG 2 00 

40 TGAGACGCTG GGCCGCATCT TCAACGTTCT GGGCGACGCG ATCGACCAGC 250 

GCGGCCCCGT GGGCGAGAAG ATGCGCATGG CGATCCACGC CGAGGCCCCA 300 

AAGCTGGCGG ATCAGGCCGC GGAGGACACG ATCCTGACGA CCGGCATCAA 350 

GGTGATCGAC CTGATTCTGC CCTACTGCAA GGGTGGCAAG ATCGGCCTGT 400 

TCGGCGGTGC CGGTGTGGGC AAGACTGTGA TCATCATGGA GCTGATCAAC 450 

45 AACGTCGCGA AGGGCCACGG TGGTTTCTCC GTGTTTGCCG GCGTTGGCGA 500 

GCGCACGCGC GAGGGCACGG ACCTGTACCT GGAGATGATG CAGTCGAAGG 550 

TGATTGACCT GAAGGGCGAG TCGAAGTGCG TGCTTGTGTA CGGGCAGATG 600 

AACGAGCCCC CGGGTGCGCG CGCGCGCGTT GCGCAGTCTG CGCTGACGAT 650 

GGCGGAGTAC TTCCGCGACG TGGAGGGCCA GAACGTGCTG CTGTTCATCG 700 

50 ACAACATCTT CCGCTTCACG CAGGCGAACT CCGAGGTGTC TGCGCTGCTG 750 

GGCCGCATTC CAGCCGCCGT GGGCTACCAG CCGACGCTTG CGGAGGATCT 800 

TGGTATGCTG CAGGAGCGCA TCACGTCGAC AACGAAGGGG TCGATCACGT 850 

CCGTGCAGGC CGTGTACGTG CCAGCGGATG ATATCACGGA TCCCGCGCCC 900 

GCGACGACGT TCTCGCACCT GGACGCGACG ACTGTGCTGG ACCGCGCGGT 950 

55 GGCGGAGTCG GGCATCTACC CTGCCGTGAA CCCGCTGGAG TGCGCGTCGC 1000 

GTATCATGGA CCCCGACGTG ATCGATGTGG ACCACTACAA CGTTGCGCAG 1050 

GATATCGTGC AGATGCTGAC CAAGTACAAG GAGCTGCAGG ATATCATTGC 1100 

GGTGCTTGGC ATCGACGAGC TGAGCGAGGA AGACAAGGTT GTTGTGGACC 1150 

GCGCGCGCAA GGTGACCCGG TTCCTGTCGC AGCCGTTCCA GGTTGCGGAG 1200 

60 GTGTTCACGG GCATGACGGG CCACTACGTG CAGCTGGTCG ACACGGTGGA 1250 

329 
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GTCGTTCTCT GGCCTGCTGA TGGGGTCGTA CGACCAGATC CCGGA^QA 1297. 



5 



2) INFORMATION FOR SEQ ID NO: 533 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1298 bases 

(B) TYPE: Nucleic acid 



10 



(C) STRANDEDNESS : Double 



(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



15 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania donovani 

(B) STRAIN: ATCC 50212 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 533 



20 



CTTCTCGGAG GGCGTGCCGC CCGTACTGAC GGCGCTGGAT GTGACGGAGG 
ACCTTGGCCG CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAT 
GCGAACACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 
GAAGTCGAAG GTTGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 

25 GTGAGACGCT GGGCCGCATC TTCAACGTTC TGGGCGACGC GATCGACCAG 
CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 
GAAGCTGGCG GACCAGGCCG CGGAGGACAC GATCCTGACG ACCGGCATCA 
AGGTGATCGA CCTCATTCTG CCCTACTGCA AGGGCGGCAA GATCGGCCTG 
TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 

3 0 CAACGTCGCG AAGGGCCACG GTGGCTTCTC CGTGTTTGCC GGCGTTGGCG 
AGCGCACGCG CGAGGGCACG GACCTATACC TGGAGATGAT GCAGTCGAAG' 
GTGATTGACC TGAAGGGCGA GTCGAAGTGC GTGCTTGTGT ACGGGCAGAT 
GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 
TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 

35 GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT 
GGGCCGCATT CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCCGAGGATC 
TTGGTATGCT GCAGGAGCGC ATCACATCGA CGACGAAGGG GTCGATCACG 
TCCGTGCAGG CCGTGTACGT GCCGGCGGAT GATATCACGG ATCCCGCGCC 
CGCGACGACG TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGCGCGG 

40 TGGCGGAGTC GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG 
CGTATCATGG ACCCCGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 
GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 
CGGTGCTTGG CATCGACGAG CTGAGCGAGG AGGACAAGGT TGTGGTGGAC 
CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 

45 GGTGTTCACG GGCATGACGG GCCACTACGT GCAGCTGGCC GACACGGTGG 
AGTCGTTCTC TGGCCTGCTG ATGGGGTCGT ACGACCAGAT CCCGGAGA 



50 2) INFORMATION FOR SEQ ID NO: 534 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1298 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 98 bases 

(B) TYPE: Nucleic acid 



(C) STRANDEDNESS: Double 



(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



60 



(vi) ORIGINAL SOURCE: 



330 
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(A) ORGANISM: Leishmania infantum 

(B) STRAIN: MOU 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 534 

5 

CTTCTCGGAG GGCGTGCCGC CCGTACTGAC GGCGCTGGAT GTGACGGAGG 50 

ACCTTGGCCG CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAT 100 

GCGAACACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

GAAGTCGAAG GTTGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 200 

10 GTGAGACGCT GGGCCGCATC TTCAACGTTC TGGGCGACGC GATCGACCAG 250 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 300 

AAAGCTGGCG GACCAGGCCG CGGAGGACAC GATCCTGACG ACCGGCATCA 350 

AGGTGATCGA CCTCATTCTG CCCTACTGCA AGGGCGGCAA GATCGGCCTG 400 

TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 450 

15 CAACGTCGCG AAGGGCCACG GTGGCTTCTC CGTGTTTGCC GGCGTTGGCG 500 

AGCGCACGCG CGAGGGCACG GACCTATACC TGGAGATGAT GCAGTCGAAG 550 

GTGATTGACC TGAAGGGCGA GTCGAAGTGC GTGCTTGTGT ACGGGCAGAT 600 

GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 700 

20 GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT 750 

GGGCCGCATT CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCCGAGGATC 800 

TTGGTATGCT GCAGGAGCGC ATCACATCGA CGACGAAGGG GTCGATCACG 850 

TCCGTGCAGG CCGTGTACGT GCCGGCGGAT GATATCACTG ATCCCGCGCC 900 

CGCGACGACG TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGCGCGG 950 

25 TGGCGGAGTC GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG 1000 

CGTATCATGG ACCCCGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 1050 

GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 1100 

CGGTGCTTGG CATCGACGAG CTGAGCGAGG AGGACAAGGT TGTGGTGGAC 1150 

CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 1200 

30 GGTGTTCACG GGCATGACGG GCCACTACGT GCAGCTGGCC GACACGGTGG 1250 

AGTCGTTCTC TGGCCTGCTG ATGGGGTCGT ACGACCAGAT CCCGGAGA 1298 



35 2) INFORMATION FOR SEQ ID NO: 535 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1301 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania gerbilli 

(B) STRAIN: ATCC 50121 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 535 



50 



GCACTTCTCG GAGGGCGTGC CGCCCGTGCT GACGGCGCTG GATGTGACGG 50 

AGGACCTTGG CCGCGATGAG CCGCTGACGC TGGAGATCGT GCAGCACTTG 100 

GACGCGAACA CCGGCCGCTG CATTGCGATG CAGACGACGG ACCTGCTGAA 150 

GCTGAAGTCG AAGGTTGTGT CGACCGGTGG CAACATCTCT GTGCCGGTGG 200 

55 GCCGTGAGAC GCTGGGCCGC ATCTTCAACG TTCTGGGCGA TGCGATCGAC 250 

CAGCGCGGCC CCGTGGGCGA GAAGATGCGC ATGGCGATCC ACGCCGAGGC 300 

CCCGAAGCTG GCGGATCAGG CCGCGGAGGA CACGATCCTG ACGACCGGCA 350 

TCAAGGTGAT CGACCTGATT CTGCCCTACT GCAAGGGTGG CAAGATCGGY 400 

CTGTTCGGCG GTGCCGGTGT GGGCAAGACT GTGATCATCA TGGAGCTGAT 450 

60 CAACAACGTC GCGAAGGGCC ACGGTGGTTT CTCCGTGTTT GCCGGCGTTG 500 

331 
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GCGAGCGCAC GCGCGAGGGC ACGGACCTGT ACCTGGAGAT GATGCAGTCG " ' STsT) 

AAGGTGATTG ACCTGAAGGG CGAGTCGAAG TGCGTGCTTG TGTACGbGCA 60fV 

GATGAACGAG CCCCCGGGTG CGCGCGCGCG CGTTGCGCAG TCTGCGCTGA 650 

CGATGGCGGA GTACTTCCGC GACGTGGAGG GCCAGAACGT GCTGCTGTTC 700 

5 ATCGACAACA TCTTCCGCTT CACGCAGGCG AACTCCGAGG TGTCCGCGCT 750 

GCTGGGCCGC ATTCCGGCCG CCGTGGGCTA CCAGCCGACG CTTGCGGAGG 800 

ATCTTGGTAT GCTGCAGGAG CGCATCACGT CGACAACGAA GGGGTCGATC 850 

ACGTCCGTGC AGGCCGTGTA CGTGCCAGCG GATGATATCA CGGATCCCGC 900 

GCCCGCGACG ACGTTCTCGC ACCTTGACGC GACGACTGTG CTGGACCGCG 950 

10 CGGTGGCGGA GTCGGGCATC TACCCTGCCG TGAACCCGCT GGAGTGCGCG 1000 

TCGCGTATCA TGGACCCCGA TGTGATCGAT GTGGACCACT ACAACGTTGC 1050 

GCAGGATATC GTGCAGATGC TGACCAAGTA CAAGGAGCTG CAGGACATCA 1100 

TTGCGGTGCT TGGCATCGAC GAGCTGAGCG AGGAAGACAA GGTTGTGGTG 1150 

GACCGCGCGC GCAAGGTGAC CCGGTTCCTG TCGCAGCCGT TCCAGGTTGC 1200 

15 GGAGGTGTTC ACGGGCATGA CGGGCCACTA CGTGCAGCTG GTCGACACGG 1250 

TGGAGTCGTT CTCTGGCTTG CTGATGGGGT CGTACGACCA GATCCCGGAG 13 00 

A 1301 



20 



2) INFORMATION FOR SEQ ID NO: 536 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 98 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania hertigi 

(B) STRAIN: ATCC 50125 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 536 

CTTCGCGGAG GGCGTGCCGC CGGTGCTGAC GTCGCTGGAT GTGACGGAGA 50 

ACCTCGGCCG CGATGAGCCG CTGACGCTGG AGATTGTGCA GCACTTGGAC 100 

GCGAACACCG GTCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

4 0 GAAGTCGAAG GTCGTGTCGA CCGGTGGCAA CATCTCTGTG CCTGTTGGCC 200 

GCGAGACGCT GGGTCGCATC TTCAACGTGC TTGGCGATGC GATTGACCAG 250 

CGCGGCCCTG TGGGTGAGAA GATGCGCATG GCGATCCACG CCGAGGCGCC 300 

GAAGCTGGCG GATCAGGCGG CAGAGGACAC GATCCTGACG ACCGGCATCA 350 

AGGTGATCGA TCTTATTCTG CCGTACTGCA AGGGTGGTAA GATCGGTCTG 400 

45 TTCGGTGGTG CCGGTGTAGG CAAGACTGTG ATTATTATGG AGCTGATCAA 450 

TAACGTGGCG AAGGGCCACG GTGGGTTTTC CGTGTTTGCT GGCGTGGGCG 500 

AGCGCACGCG CGAGGGCACT GACCTGTACC TGGAGATGAT GCAGTCGAAG 550 

GTGATTGACC TGAAGGGCGA ATCAAAGTGC GTGCTTGTGT ACGGACAGAT 600 

GAACGAGCCC CCGGGTGCGC GTGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

50 TGGCCGAGTA CTTCCGCGAT GTGGAGGGCC AGAACGTGCT GCTGTTCATT 700 

GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT 750 

GGGTCGCATT CCTGCCGCCG TGGGCTACCA GCCGACGCTT GCGGAGGATC 800 

TGGGCATGCT GCAGGAGCGC ATTACGTCGA CGACGAAGGG CTCGATTACG 850 

TCTGTGCAGG CCGTGTACGT GCCTGCGGAT GATATCACGG ACCCGGCGCC 900 

55. CGCGACGACG TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGTGCGG 950 

TGGCAGAGTC GGGCATTTAC CCTGCGGTGA ACCCGCTGGA GTGCGCGTCG 1000 

CGTATCATGG ACCCCGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 1050 

GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 1100 

CCGTGCTTGG TATCGACGAG CTGAGCGAGG AGGACAAGGT TGTGGTGGAC 1150 

60 CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 1200 

332 
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GGTGTTCACT GGCATGACGG GTCACTACGT TCAGCTGGAG GACA4GGTGG 12 5 1 
AGTCGTTCTC TGGCCTACTG ATGGGGTCAT ACGACCAGAT CCCGGAGA 12 



2) INFORMATION FOR SEQ ID NO: 537 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 97 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii)MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmanial major 

(B) STRAIN: ATCC 50122 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 537 

CTTCTCGGAG GGCGTGCCGC CCGTGCTGAC GGCGCTGGAT GTGACGGAGG 50 

ACCTTGGCCG TGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAC 100 

GCGAACACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

25 GAAGTCGAAG GTTGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 200 

GTGAGACGCT GGGCCGCATC TTCAACGTTC TGGGCGATGC GATCGACCAG 250 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 3 00 

GAAGCTGGCG GATCAGGCCG CAGAGGACAC GATCCTGACG ACCGGCATCA 350 

AGGTGATCGA CTTGATCCTG CCCTACTGCA AGGGTGGCAA GATCGGCCTG 400 

30 TTCGGCGGTG CCGGTGTGGG CAAGACTGTG 'ATCATCATGG AGCTGATCAA 4 50 

CAATGTCGCG AAGGGCCACG GTGGTTTCTC CGTGTTTGCC GGCGTTGGCG 500 

AGCGCACGCG . CGAGGGCACG GACCTGTACC TGGAGATGAT GCAGTCGAAG 550 

GTGATTGACC TGAAGGGCGA GTCGAAGTGC GTGCTTGTGT ACGGGCAGAT 600 

GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

35 TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 700 

GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CCGCGCTGCT 750 

GGGCCGCATT CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCGGAGGATC 800 

TTGGTATGCT GCAGGAGCGC ATCACGTCGA CAACGAAGGG GTCGATCACG 850 

TCCGTGCAGG CCGTGTACGT GCCAGCGGAT GATATCACGG ATCCCGCGCC 900 

40 CGCGACGACG TTCTCGCACC TGGATGCGAC GACTGTGCTG GACCGCGCGG 950 

TGGCGGAGTC GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG 1000 

CGTATCATGG ACCCCGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 1050 

GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GACATCATTG 1100 

CGGTGCTTGG CATCGACGAG CTGAGCGAGG AAGACAAGGT TGTGGTGGAC 1150 

45 CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 12 00 

GGTGTTCACG GGCATGACGG GCCACTACGT GCAGCTGGTC GACACGGTGG 12 50 

AGTCGTTCTC TGGCCTGCTG ATGGGGTCGT ACGACCAGAT CCCGGAG 12 97 



50 



60 



2) INFORMATION FOR SEQ ID NO: 538 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 12 97 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

333 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania amazonensis 

(B) STRAIN: ATCC 50131 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 538 

TTCTCGGAGG GCGTGCCGCC CGTGCTGACG GCGCTGGATG TGACGGAGGA 50 

CCTTGGCCGC GATGAGCCGC TGACGCTGGA GATCGTGCAG CACCTGGACG 100 

CGAACACCGG CCGCTGCATT GCGATGCAGA CGACGGACCT GTTGAAGCTG 150 

AAGTCGAAGG TTGTGTCGAC CGGCGGCAAC ATCTCTGTGC CGGTGGGCCG 200 

TGAGACGCTG GGCCGCATCT TCAACGTGCT GGGCGACGCG ATCGACCAGC 2 50 

GCGGCCCCGT GGGTGAGAAG ATGCGCATGG CGATCCACGC CGAGGCCCCG 300 

AAGCTGGCGG ATCAGGCCGC GGAGGACACG ATCCTGACGA CCGGCATCAA 350 

GGTGATCGAC CTGATTCTGC CCTACTGCAA GGGTGGCAAG ATCGGCCTGT 400 

TTGGTGGCGC CGGTGTGGGC AAGACCGTGA TCATCATGGA GTTGATTAAC 4 50 

AACGTCGCGA AGGGCCACGG TGGTTTCTCG GTGTTTGCCG GCGTTGGCGA ' 500 

GCGCACGCGC GAGGGCACGG ACCTGTACCT GGAGATGATG CAGTCGAAGG 550 

TGATTGACCT GAAGGGCGAG TCGAAGTGCG TGCTTGTGTA CGGGCAGATG 600 

AACGAGCCCC CGGGTGCGCG CGCGCGCGTT GCGCAGTCTG CGCTGACGAT 650 

GGCGGAGTAC TTCCGAGACG TGGAGGGCCA GAATGTGCTG CTGTTCATCG 700 

ACAACATCTT CCGCTTCACG CAGGCGAACT CCGAGGTGTC TGCGCTGCTG 750 

GGCCGCATTC CGGCCGCCGT GGGCTACCAG CCGACGCTTG CGGAGGATCT 800 

TGGTATGCTG CAGGAGCGCA TCACGTCGAC GACGAAGGGG TCGATCACGT 850 

CCGTGCAGGC CGTGTACGTG CCTGCGGATG ATATCACGGA TCCGGCGCCC 900 

GCGACGACGT TCTCGCACCT GGACGCGACG ACTGTGCTGG ACCGCGCGGT 950 

GGCGGAGTCG GGGATCTACC CTGCCGTGAA CCCGCTGGAG TGCGCGTCGC 1000 

GTATCATGGA CCCCGATGTG ATCGACGTGG ACCACTACAA CGTTGCGCAG 1050 

GATATCGTGC AGATGCTGAC CAAGTACAAG GAGCTGCAGG ATATCATTGC 1100 

GGTGCTTGGT ATCGACGAGC TGAGCGAGGA GGACAAGGTC GTGGTGGACC 1150 

GCGCGCGCAA GGTGACCCGG TTCCTGTCGC AGCCGTTCCA GGTTGCGGAG 12 00 

GTGTTCACGG GCATGACGGG CCACTACGTG CAGCTGGCCG ACACGGTGGA 1250 

GTCGTTCTCT GGGCTGCTGA TGGGGTCGTA CGAC CAGATC CCGGAGA 1297 



2 ) INFORMATION FOR SEQ ID NO: 539 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 539 
CCITACATCC TBGTYGCICT IAACAAG 



2) INFORMATION FOR SEQ ID NO: 540 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA • 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 0 



GGDGCITCYT CRTCGWAITC CTG 



23 



10 



15 



2) INFORMATION FOR SEQ ID NO: 541 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 541 



20 GTKGAAATGT TCCGCAAGCT GCT 



23 



25 



30 



2) INFORMATION FOR SEQ ID NO: 542 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) . TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 542 



CGGAARTAGA ACTGSGGACG GTAG 



24 



40 2 ) INFORMATION FOR SEQ ID NO: 543 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 543 



ATCTTAGTAG TTTCTGCTGC TGA 



23 



55 



2 ) INFORMATION FOR SEQ ID NO: 544 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 544 
AYGTTGTCGC CMGGCATTMC CAT 



2) INFORMATION FOR SEQ ID NO: 545 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 545 
TACATCCTBG TYGCICTIAA CAAGTG 



2) INFORMATION FOR SEQ ID NO: 546 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 546 
CCRCGICCGG TRATGGTGAA GAT 



2) INFORMATION FOR SEQ ID NO: 547 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 547 
GTACAGTTGC TTCAGGACGT ATC 



2) INFORMATION FOR SEQ ID NO: 548 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 548 
ACGTTCGATT TCATCACGTT G 



2) INFORMATION FOR SEQ ID NO: 549 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 549 
GAACGTGATA CTGACAAACC TTTA 



2 ) INFORMATION FOR SEQ ID NO: 550 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 550 
GAAGAAGAAC ACCAACGTTG 



2 ) INFORMATION FOR SEQ ID NO: 551 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 551 
GAAGAAAAAA TCTTCGAACT GGCTA 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 555 
5 TCTCGAACTT TCTCTATGTA TGCA 24 



2) INFORMATION FOR SEQ ID NO: 556 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 556 

20 

CGGCGCNATC YTSGTTGTTG C 21 



25 2) INFORMATION FOR SEQ ID NO: 557 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 557 

CCMAGGCATR ACCATCTCGG TG 22 



40 



50 



55 



2) INFORMATION FOR SEQ ID NO: 558 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 558 
TCITTYAART AYGCITGGGT 20 



2 ) INFORMATION FOR SEQ ID NO: 559 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 23 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 559 
CCGACRGCRA YIGTYTGICK CAT 



2 ) INFORMATION FOR SEQ ID NO: 560 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 560 
GAYTTCATYA ARAAYATGAT YAC 



2) INFORMATION FOR SEQ ID NO: 561 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 561 
ACIGTICGGC CRCCCTCACG GAT 



2) INFORMATION FOR SEQ ID NO: 562 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 562 
CARATGRAYG ARCCICCIGG IGYIMGIATG 
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15 



50 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 563 
GGYTGRTAIC CIACIGCIGA IGGCAT 26 



2 ) INFORMATION FOR SEQ ID NO: 564 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 29 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 564 
TAYGGICARA TGAAYGARCC ICCIGGIAA 29 

30 

2) INFORMATION FOR SEQ ID NO: 565 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 565 
45 GGYTGRTAIC CIACIGCIGA IGGDAT 26 



2) INFORMATION FOR SEQ ID NO: 566 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 566 

60 
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5 2) INFORMATION FOR SEQ ID NO: 567 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 567 

TCRTCIGCIG GIACRTAIAY IGCYTG - 26 



20. 



30 



35 



2) INFORMATION FOR SEQ ID NO: 568 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 568' 
RTIATIGGIG CIGTIRTIGA YGT 23 



2) INFORMATION FOR SEQ ID NO: 569 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5.69 
RTIRTIGGIS CIGTIRTIGA TAT 23 

50 



2) INFORMATION FOR SEQ ID NO: 570 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

60 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 570 
RTIRYIGGIC CIGTIRTIGA YGT 



2) INFORMATION FOR SEQ ID NO: 571 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 571 
RTIRTIGGIC CIGTIRTIGA TGT 



2) INFORMATION FOR SEQ ID NO: 572 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 572 
RTIRTIGGIS CIGTIRTIGA 



2) INFORMATION FOR SEQ ID NO: 573 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 573 

a 

CCICCIACCA TRTARAAIGC 



2) INFORMATION FOR SEQ ID NO: 574 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 574 
ATIGCIATGG AYGGIACIGA RGG 



2) INFORMATION FOR SEQ ID NO: 575 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 575 
TIACCATTTC AGTACCTTCT GGTAA 



2) INFORMATION FOR SEQ ID NO: 576 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 576 
AACTTCRTCA AGAAGGTYGG TTACAA 



2) INFORMATION FOR SEQ ID NO: 577 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 577 
CATGATTGAA CCATCCACCA 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 578 
CATGATTGAA GCTTCCACCA 



2) INFORMATION FOR SEQ ID NO: 579 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 579 
GAAGGCCGTG CTGGTGAGAA 



2) INFORMATION FOR SEQ ID NO: 580 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 580 
GCTAAACCAG CTACAATCAC TCCAC 



2 ) INFORMATION FOR SEQ ID NO: 581 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 581 
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ACATCGGTGC ATTATTACGT GG 



2 ) INFORMATION FOR SEQ ID NO: 582 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 582 
TTTCAACTTC GTCGTTGACA CGAACAGT 



2) INFORMATION FOR SEQ ID NO: 583 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 583 
CAACTGCTTT TTGGATATCT TCTTTAATAC CAACG 



2) INFORMATION FOR SEQ ID NO: 584 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 584 
ACATGACACA TCTAAAACAA 



2) INFORMATION FOR SEQ ID NO: 585 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 585 
5 ACCACATACT GAATTCAAAG 2 0 



2) INFORMATION FOR SEQ ID NO: 586 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 586 

20 

CAGAAGTATA CGTATTATCA 2 0 



25 2 ) INFORMATION FOR SEQ ID NO: 587 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 587 

CGTATTATCA AAAGACGAAG 2 0 



40 



50 



55 



2) INFORMATION FOR SEQ ID NO: 588 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 588 
TCTTCTCAAA CTATCGTCCA 2 0 



2) INFORMATION FOR SEQ ID NO: 589 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 20 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 589 
GCACGAAACT TCTAAAACAA 



2) INFORMATION FOR SEQ ID NO: 590 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 590 
TATACGTATT ATCTAAAGAT 



2) INFORMATION FOR SEQ ID NO: 591 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 591 
TCCTGGTTCT ATTACACCAC 



2) INFORMATION FOR SEQ ID NO: 592 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 592 
CAAAGCTGAA GTATACGTAT 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 593 
TTCACTAACT ATCGCCCACA 2 0 



2) INFORMATION FOR SEQ ID NO: 594 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 594 
ATTGGTATCC ATGACACTTC 2 0 

30 

2) INFORMATION FOR SEQ ID NO: 595 ; 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 595 
4 5 TTAAAGCAGA CGTATACGTT 2 0 



2) INFORMATION FOR SEQ ID NO: 596 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: . Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 596 

60 
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5 2) INFORMATION FOR SEQ ID NO: 597 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE : DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 597 

ATTGGTATCA AAGAAACTTC 2 0 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 598 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 598 
AATTACACCT CACACAAAAT .20 



2) INFORMATION FOR SEQ ID NO: 599 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 599 
CGGTGAAGAA ATCGAAATCA 2 0 

50 



2) INFORMATION FOR SEQ ID NO: 600 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

60 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 600 
ATGCAAGAAG AATCAAGCAA 



2) INFORMATION FOR SEQ ID NO: 601 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 601 
GTTTCACGTG ATGATGTACA 



2) INFORMATION FOR SEQ ID NO: 602 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 602 
AAGTTGAAGT TGTTGGTATT 



2) INFORMATION FOR SEQ ID NO: 603 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 603 
GGTATTAAAG ACGAAACATC 



2) INFORMATION FOR SEQ ID NO: 604 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 604 
GGTGATGAAG TAGAAATCGT 



PCT/CA00/01150 
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15 



20 



2) INFORMATION FOR SEQ ID NO: 605 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 605 



2 5 GAAATGTTCC GTAAATTATT 



20 



30 



35 



40 



2 ) INFORMATION FOR SEQ ID NO: 606 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 606 
ATTAGACTAC GCTGAAGCTG 



20 



45 2) INFORMATION FOR SEQ ID NO: 607 



50 



55 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: ATCC 29212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 607 



352 
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PCT/CA00/01150 



CGGAGCTATC 
GTGAACATAT 
TTCTTAAACA 
AGAAATGGAA 
ATGTTCCAGT 
TCTTATGAAG 
CCCAACTCCA 
ACGTATTCTC 
CGTGGTGAAG 
CGAAACATCT 
TAGACTACGC 
GCACGTGAAG 
CACTCCACAC 
AAGGCGGACG 
TTCCGTACAA 
AATGGTAATG 
CAATCGCTAT 



TTAGTAGTTT 
CTTATTATCA 
AAATGGATAT 
GTTCGTGACT 
TATCGCAGGT 
AAAAAATCTT 
GAACGTGATA 
AATCACTGGA 
TTCGCGTTGG 
AAAACAACTG 
TGAAGCAGGC 
ATATCGAACG 
ACAAAATTCA 
TCACACTCCA 
CAGACGTTAC 
CCTGGTGATA 
CGAAGACGGA 



CTGCTGCTGA 
CGTAACGTTG 
GGTTGATGAC 
TATTATCAGA 
TCTGCTTTGA 
AGAATTAATG 
CTGACAAACC 
CGTGGTACTG 
TGACGAAGTT 
TTACAGGTGT 
GACAACTTCG 
TGGACAAGTA 
AAGCTGAAGT 
TTCTTCACTA 
TGGTGTTGTA 
ACGTTGCTAT 
A 



TGGTCCTATG CCTCAAACAC 
GTGTACCATA CATCGTTGTA 
GAAGAATTAT TAGAATTAGT 
ATACGATTTC CCAGGCGATG 
AAGCTTTAGA AGGCGACGAG 
GCTGCAGTTG ACGAATATAT 
ATTCATGATG CCAGTCGAAG 
TTGCTACAGG ACGTGTTGAA 
GAAATCGTTG GTATTAAAGA 
TGAAATGTTC CGTAAATTAT 
GTGCTTTATT ACGTGGTGTA 
TTAGCTAAAC CAGCTACAAT 
ATACGTATTA TCAAAAGAAG 
ACTACCGTCC TCAATTCTAC 
GAATTGCCAG AAGGTACTGA 
GGACGTTGAA TTAATTCACC 



5tf 

100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
821 



2 ) INFORMATION FOR SEQ ID NO: 608 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: ATCC 19434 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 608 

CGGAGCTATC TTGGTAGTTT CTGCTGCTGA CGGCCCAATG CCTCAAACTC 50 

GTGAACACAT CCTATTGTCT CGTCAAGTTG GTGTTCCTTA CATCGTTGTA 100 

TTCTTGAACA AAGTAGACAT GGTTGATGAC GAAGAATTAC TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACC TATTAACAGA ATACRAATTC CCTGGTGRCG 200 

ATGTTCCTGT AGTTGCTGGA TCAGCTTTGA AAGCTCTAGA AGGCGACGCT 250 

TCATACGAAG AAAAAATTCT TGAATTAATG GCTGCAGTTG ACGAATACAT 300 

CCCAACTCCA GAACGTGACA ACGACAAACC ATTCATGATG CCAGTTGAAG 350 

ACGTGTTCTC AATTACTGGA CGTGGTACTG TTGCTACAGG TCGTGTTGAA 400 

CGTGGACAAG TTCGCGTTGG TGACGAAGTT GAAGTTGTTG GTATTGCTGA 450 

AGAAACTTCA AAAACAACAG TTACTGGTGT TGAAATGTTC CGTAAATTGT 500 

TAGACYACGC TGAAGCTGGA GACRACATTG GTGCTTTACT ACGTGGTGTT 550 

GCACGTGAAG ACATCCAACG TGGACAAGTT TTAGCTAAAC CAGGTACAAT 600 

CACACCTCRT ACAAAATTCT CTGCAGAAGT ATACGTGTTG ACAAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCACTA ACTACCGTCC ACAATTCTAC 700 

TTCCGTACAA CTGACGTAAC AGGTGTTGTT GAATTACCAG AAGGAACTGA 750 

A 751 



2) INFORMATION FOR SEQ ID NO: 609 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: ATCC 4 9573 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 609 

CGGTGCGATC TTAGTAGTAT CTGCTGCTGA CGGTCCTATG CCTCAAACTC 50 

GTGAACACAT CTTGTTATCA CGTAACGTTG GCGWACCATA CATCGTTGTT 100 

TTCTTGAACA AAATGGATAT GGTTGAYGAC GAAGAATTGC TAGAATTAGT 150 

15 TGAAATGGAA GTTCGTGACC TATTGTCTGA ATATGACTTC CCAGGCGACG 200 

ATGTTCCTGT AATCGCCGGT TCTGCTTTGA AAGCTCTTGA AGGAGATCCT 250 

TCATACGAAG AAAAAATCAT GGAATTGATG GCTGCAGTTG ACGAATACGT 300 

TCCAACTCCA GAACGTGATA CTGACAAACC ATT CATGATG CCAGTCGAAG 350 

ACGTATTCTC AATCACTGGA CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 

20 CGTGGACAAG TTCGCGTTGG TGATGAAGTA GAAATCGTTG GTATTGCTGA 450 

CGAAACTGCT AAAACAACTG TAACAGGTGT TGAAATGTTC CGTAAATTGT 500 

TAGACTATGC TGAAGCAGGG GATAACATTG GTGCATTGCT ACGTGGGGTT 550 

GCTCGTGAAG ACATCCAACG TGGACAAGTA TTGGCTAAAG CTGGTACRAT 600 

CACACCTCAT ACAAAATTCA AAGCTGAAGT TTATGTTTTG ACAAAAGAAG 650 

25 AAGGTGGRCG TCACACTCCA TTCTTCACTA ACTACCGTCC TCAGTTCTAC 700 

TTCCGTACAA CTGACGTAAC TGGTGTTGTT GAATTACCAG AAGGAACTGA 750 

A 751 



30 



2) INFORMATION FOR SEQ ID NO: 610 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 891 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



45 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Haemophilus influenzae Rd 

(B) STRAIN: KW2 0 

(C) ACCESSION NUMBER: extracted from U32739 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 610 



AATATGATTA CTGGTGCGGC ACAAATGGAT GGTGCTATTT TAGTAGTAGC 50 

AGCAACAGAT GGTCCTATGC CACAAACTCG TGAACACATC TTATTAGGTC 100 

50 GCCAAGTAGG TGTTCCATAC ATCATCGTAT TCTTAAACAA ATGCGACATG 150 

GTAGATGACG AAGAGTTATT AGAATTAGTC GAAATGGAAG TTCGTGAACT 200 

TCTATCTCAA TATGACTTCC CAGGTGACGA TACACCAATC GTACGTGGTT 250 

• CAGCATTACA AGCGTTAAAC GGCGTAGCAG AATGGGAAGA AAAAATCCTT 300 

GAGTTAGCAA ACCACTTAGA TACTTACATC CCAGAACCAG AACGTGCGAT 350 

55 TGACCAACCG TTCCTTCTTC CAATCGAAGA TGTGTTCTCA ATCTCAGGTC 4 00 

GTGGTACTGT AGTAACAGGT CGTGTAGAAC GAGGTATTAT CCGTACAGGT 450 

GATGAAGTAG AAATCGTCGG TATCAAAGAT ACAGCGAAAA CTACTGTAAC 500 

GGGTGTTGAA ATGTTCCGTA AATTACTTGA CGAAGGTCGT GCAGGTGAAA 550 

ACATCGGTGC ATTATTACGT GGTACCAAAC GTGAAGAAAT CGAACGTGGT 600 

60 CAAGTATTAG CGAAACCAGG TTCAATCACA CCACACACTG ACTTCGAATC 650 
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AGAAGTGTAC GTATTATCAA AAGATGAAGG 
TCAAAGGTTA CCGTCCACAA TTCTATTTCC 
ACAATCGAAT TACCAGAAGG CGTGGAAATG 
CAAGATGACA GTAAGCTTAA TCCACCCAAT 
GTTTCGCAAT CCGTGAAGGT GGCCGTACAG 
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TGGTCGTCAT ACTCCA^TCT 700 

GTACAACAGA CGTGACTGGT 750- 

GTAATGCCAG GCGATAACAT 800 

TGCGATGGAT CAAGGTTTAC 850 

TAGGTGCAGG C 891 



2) INFORMATION FOR SEQ ID NO: 611 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 14990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 611 

25 CGGCGGTATC TTAGTTGTAT CTGCTGCTGA CGGTCCAATG CCACAAACTC 50 

GTGAACACAT CTTATTATCA CGTAACGTTG GTGTACCAGC ATTAGTTGTA 100 

TTCTTAAACA AAGTTGACAT GGTAGACGAC GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGACG 2 00 

ATGTACCTGT AATCGCTGGT TCTGCATTAA AAGCATTAGA AGGCGATGCT 2 50 

30 GAATACGAAC AAAAAATCTT AGACTTAATG CAAGCAGTTG ATGATTACAT 3 00 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 4 00 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTATGCACGA 4 50 

AACTTCTAAA ACAAC TGTTA CTGGTGTAGA AATGTTCCGT AAATTATTAG 500 

35 ACTACGCTGA- AGCTGGTGAC AACATCGGTG CTTTATTACG TGGTGTTGCA 550 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTC TATTAC 600 

ACCACACACA AAATTCAAAG CTGAAGTATA CGTATTATCT AAAGATGAAG 650 

GTGGACGTCA CACTCCATTC TTCACTAACT ATCGCCCACA ATTCTATTTC 700 

CGTACTACTG ACGTAACTGG TGTTGTAAAC TTACCAGAAG GTACAGAAAT 750 

40 GGTTATGCCT GGCGACAACG TTGAAATGAC AGTTGAATTA ATCGCTCCAA 800 

TCGCTATCGA AGACGGAA 818 



45 2) INFORMATION FOR SEQ ID NO: 612 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Parathyphi A 

(B) STRAIN: ATCC 9150 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 612 
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GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAfcjACCCG 5* 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT VQO 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

5 GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGACTTCC CGGGCGACGA 200 

CACGCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG 250 

AGTGGGAAGC GAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTACATC 3 00 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 350 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC 400 

10 GCGGTATCAT CAAAGTGGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAG 450 

ACTCAGAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCCGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAG 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG 650 

15 CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACTACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCAATGGAC GACGGTCTGC GTTTC 825 



20 



2) INFORMATION FOR SEQ ID NO: 613 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 778 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia ficaria 

(B) STRAIN: ATCC 33105 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 613 



GGCGCTATCC TGGTTGTTGC TGCGACTGAC GGCCCAATGC CTCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GYCAGGTTGG CGTTCCTTTC ATCATCGTRT 100 

4 0 TCATGAACAA ATGCGACATG GTTGATGATG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TTCGCGAACT GCTGTCCGCT TACGACTTCC CTGGCGATGA 200 

CCTGCCGGTG ATTCGCGGTT CCGCGCTGAA AGCGCTGGAA GGCGAAGCCG 250 

AGTGGGAAGC TAAAATCATC GAGCTGGCTG AMCMSCTGGA TACTTACATC 3 00 

CCAGAACCAG AGCGCGCTAT CGACAAGCCG TTCCTGCTGC CAATCGAAGA 350 

45 CGTATTCTCC ATCTCCGGTC GTGGTACCGT GGTTACCGGT CGTGTTGAGC 400 

GCGGTATCAT CAAAGTTGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAC 450 

ACCGTCAAGT CTACCTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCCGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAGC 550 

GTGAAGACAT CGAACGTGGT CAGGTTCTGG CTAAACCAGG TTCCATCAAG 600 

50 CCGCACACCC AGTTCGATTC AGAAGTGTAC ATCCTGAGCA AAGAAGAAGG 650 

TGGTCGTCAC ACKCCATTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

GTACTACTGA CGTGACCGGT ACCATCGAAC TGCCAGAAGG CGTTGAGATG 750 

GTAATGCCTG GCGACAACGT GAACATGA 778 

55 

2) INFORMATION FOR SEQ ID NO: 614 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 653 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus malodoratus 

(B) STRAIN: ATCC 43197 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 614 

GTGCGATCTT AGTAGTATCA GCTACTGATG GTCCAATGCC TCAAACTCGT 50 

GAACACATTT TGTTATCACG TCAAGTTGGT GTTAAGCACT TGATCGTTTT 100 

15 CTTGAACAAA GTAGATTTAG TTGATGACGA AGAATTGATC GACTTAGTTG 150 

AAATGGAAGT ACGTGAATTA CTTTCTGAAT ATGGTTTCCC AGGTGATGAT 200 

ATTCCAGTGC TTAAAGGTTC TGCTTTGAAA GCATTAGAAG GCGATCCAGA 250 

ACAAGAACAA GTTATTCTTG ATTTGATGGA TACCGTTGAT GAATATATCC 300 

CAACACCTGA ACGTGACAAT GACAAACCGT TCTTGTTACC AGTTGAGGAT 350 

20 GTTTTCTCGA TCACAGGACG TGGTACTGTA GCTTCTGGTC GTATCGACCG 400 

TGGCGAAGTT AAAGTCGGCG ATGAAATTGA AATCATCGGG ATCAAACCTG 450 

AAGTTCAAAA AGCAATCGTT ACTGGACTTG AAATGTTCCG TAAAACATTG 500 

GATTATGGTG AAGCTGGCGA TAACGTTGGG GTTCTATTAC GTGGGATTAC 550 

ACGTGATGAA ATCGAACGTG GCCAAGTATT AGCTAAACCA GGTTCAATCA 600 

25 CACCACATAC TAAGTTCAAA GCCGAAGTAT ATGTGTTGAC GAAAGAAGAA 650 

GGT 653 



30 2) INFORMATION FOR SEQ ID NO: 615 

(i) SEQUENCE CHARACTERISTICS : 
'(A) LENGTH: 829 bases 
(B) <TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus durans 

(B) STRAIN: ATCC 19432 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 615 



CCATTCTAGT TGTATCTGCA ACAGATGGAC CAATGCCACA AACACGTGAA 50 

CATATTTTAT TGTCACGTCA AGTAGGTGTT AAATATTTGA TCGTCTTCTT 100 

GAACAAAATC GACTTAGTAG ATGATGAAGA ATTGATTGAT CTTGTCGAAA 150 

TGGAAGTTCG TGAATTATTA AGCGAATATG GTTTCCCAGG TGACGATACA 200 

50 CCAGTCATCA AAGGTTCAGC ATTAAAAGCT TTACAAGGAG ATCCTGATGC 250 

AGAAGCAGCT ATCATGGAAT TGATGGATAC TGTTGATGAA TATATCCCAA 3 00 

CACCAGAACG TGATACAGAC AAACCATTAT TGTTACCAGT GGAAGATGTC 350 

TTCTCAATCA CAGGTCGTGG GACTGTTGCT TCAGGTCGTA TCGATCGTGG 400 

TGCAGTTCGT GTAGGTGATG AAATCGAAAT CGTCGGTATC AAACCTGAAA 450 

55 CACAAAAAGC TGTTGTAACT GGGGTCGAAA- TGTTCCGCAA GACATTAGAC 500 

TATGGTGAAG CAGGAGATAA CGTTGGGGTA TTGTTACGTG GTATCCAACG 550 

TGAAGATATC GAACGTGGAC AAGTAATCGC AAAACCAGGT TCAATCACAC 600 

CACATACAAA ATTCAAAGCA GAAGTGTACG TATTGACAAA AGAAGAAGGT 650 

GGACGTCATA CACCATTCTT CAATAACTAT CGTCCACAAT TCTACTTCCG 700 

60 TACAACTGAC GTAACTGGAA CAATCGTTTT ACCTGGAGGC ACTGAAATGG 750 
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TTATGCCTGG AGATAACGTA ACGATCGACG TTGAATTGAT CCATC^AGTT 8QO 
GCCATCGAAA ACGGAACAAC TTTCTCTAT 82 9 



2) INFORMATION FOR SEQ ID NO: 616 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 669 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus pseudoavium 

(B) STRAIN: ATCC 49372 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 616 

GGTGCAATTT TAGTAGTATC TGCTACTGAT GGCCCAATGC CACAAACACG 50 

TGAACATATC TTGTTATCAC GTCAAGTAGG GGTTAAACAC TTAATCGTCT 100 

TCTTGAACAA AGTTGATTTA GTTGATGATG AAGAATTGAT CGATTTAGTT 150 

GAAATGGAAG TTCGGGAATT GCTTTCTGAA TATGGTTTCC CAGGCGATGA 2 00 

TATTCCAGTA CTTAAAGGTT CTGCTTTGAA AGCTTTAGAA GGCGATCCTG 250 

AACAAGAACA AGTAATCCTT GACTTGATGG ATACGGTTGA TGAATACATC 3 00 

CCAACGCCTG AACGTGATAC TGACAAACCA TTCTTGTTAC CAGTCGAAGA 350 

TGTCTTCTCA ATCACAGGAC GTGGTACGGT TGCATCTGGT CGTATCGATC 400 

GTGGGGAAGT TAAAGTCGGT GATGAAGTTG AAATCATCGG GATCAAACCT 450 

GAAGTGCAAA AAGCTGTCGT AACTGGACTA GAAATGTTCC GTAAGACATT 500 

GGATTACGGT GAAGCTGGCG ATAACGTTGG GGTTCTATTA CGTGGGATTA 550 

CTCGTGATGA AATCGAACGT GGACAAGTAT TAGCTAAACC AGGTTCAATC 600 

ACTCCACATA CGAAATTCAG TGCAGAAGTT TATGTATTGA CGAAAGAAGA 650 

AGGTGGCCGT CATACGCCA 66 9 



2) INFORMATION FOR SEQ ID NO: 617 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus dispar 

(B) STRAIN: ATCC 51266 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 617 

CGGGGCAATT TTAGTTGTAT CTGCAACTGA TGGCCCAATG CCACAAACAC 50 

GTGAACACAT TTTGTTAGCT CGTCAAGTAG GGGTTAAATA TTTAATCGTC 100 

TTCTTGAACA AAACAGATTT AGTTGATGAT GAAGAATTAT TGGAACTAGT 150 

TGAAATGGAA GTTCGTGAAT TATTAAATGA ATACAATTTC CCTGGCGATG 200 

ATATTCCTGT TATTCGCGGA TCTGCTTTAA AAGCATTAGA AGGCGATCCA 250 

GAACAAGAAG AAGTAATTAT GAACTTGATG GATACTGTGG ATGAATATAT 3 00 
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CCCAACTCCA 
ATGTCTTCAC 
CGTGGTAAAG 
AGAAACACAA 
TGGATTATGG 
ACTCGTGATG 
TACACCGCAT 
AAGGTGGACG 
TTCCGTACAA 
AATGGTAATG 
CAATCGCCGT 
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GAACGTGACA 
AATTACTGGT 
TCAACGTTGG 
AAAGCTGTTG 
TGAAGCTGGT 
AAGTAGAACG 
ACCAAATTTA 
TCATACTCCT 
CTGATGTGAC 
CCTGGTGATA 
TGAAAAAGGG 



ATGATAAACC 
CGTGGTACTG 
TGATGAAATT 
TAACCGGTTT 
GATAACGTTG 
TGGTCAAGTA 
AAGGTGAAGT 
TTCTTTAATA 
TGGTAACATC 
ATGTAACAAT 
ACTACTTTCT 



ATTCTTGTTA CCACAgGAAG 
TTGCTTCAGG TCGTATCGAC 
GAAATTATCG GAATTAAACC 
GGAAATGTTC CGTAAAACTT 
GGGTCTTATT ACGTGGGATT 
TTAGCAAAAC CAGGTTCCAT 
TTATATCTTA ACAAAAGAAG 
ACTATCGTCC TCAATTTTAT 
GCATTACCTG AAGGAACTGA 
TGAAGTTGAA TTGATTCATC 
CAATT 



35Q/ 

400 

450 

500 

550 

600 

650 

700 

750 

800 

835 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 618 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 673 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus avium 

(B) STRAIN: ATCC 14025 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 618 

GGTGCAATCC TAGTAGTATC AGCTACTGAT GGTCCAATGC CGCAAACACG 50: 

TGAACATATT TTGCTATCAC GGCAAGTGGG TGTTAAACAC TTAATCGTAT 100. 

TTTTAAACAA AGTTGATTTA GTCGATGATG AAGAATTGAT CGATCTAGTT 150 

GAAATGGAAG TCCGTGAATT ACTTTCTGAA TATGGTTTCC CAGGTGACGA 200 

TATTCCAGTT CTCAAAGGTT CAGCTTTGAA AGCATTAGAA GGCGATCCTG 250 

AACAAGAACA AGTAATCCTT GATTTAATGG ATACAGTTGA CGAATATATC 3 00 

CCAACTCCAG AACGTGACAC TGACAAGCCA TTCTTGTTAC CAGTCGAAGA 350 

TGTATTTTCT ATCACTGGTC GTGGGACTGT AGCGTCTGGA CGGATTGATC 4 00 

GTGGTGAAGT TAAAGTCGGC GATGAAGTTG AAATCATCGG GATCAAACCT 450 

GAAATTCAAA AAGCAGTCGT AACTGGACTT GAAATGTTCC GTAAAACTTT 500 

AGATTATGGT GAAGCTGGCG ATAACGTTGG GGTTCTATTA CGTGGGATTA 550 

CACGTGATGA AATCGAACGT GGTCAAGTCT TAGCTAAACC AGGTTCAATC 600 

ACACCACATA CAAAATTCAG TGCAGAAGTT TACGTATTGA CGAAAGAAGA 650 

AGGTGGACGT CATACACCAT CTT 673 



2) INFORMATION FOR SEQ ID NO: 619 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1713 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 
(C) ACCESSION NUMBER: K00428 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 619 

TTTCATATTT TTAAGGATTT TGTTTTAGCA CCCATCCGAC CTCAGTCAAT 50 

ATATCCTTTC GCGACCAGGC TTTCCTCCCT TTTGCTGCTA ACTGGTTACA 100 

GATTTTCCTA TTTTTGGTCA TTTTTATCTT TGAAACTGAT TAAGCTGAAA 150 

AAATTTGAGC TTCTTTGTTG TAAACTATTT TGTGCTTTCA GTTTTATTCT 2 00 

AGCTCGACAA AGGTAACAGA CAAAAATGTC AGCTTTATTA CCAAGATTAC 250 

TCACAAGAAC AGCTTTTAAA GCTTCTGGGA AACTTCTGAG GCTCTCTTCA 3 00 

GTAATTTCTA GGACCTTTTC TCAAACTACT ACTTCCTATG CAGCTGCTTT 350 

TGATCGTTCC AAACCGCATG TAAATATAGG TACGATCGGC CATGTTGATC 4 00 

ATGGGAAGAC AACTTTAACC GCAGCCATTA CGAAAACGTT AGCCGCAAAA 450 

GGTGGTGCCA ACTTCTTGGA CTATGCTGCC ATCGATAAGG CTCCGGAAGA 500 

AAGAGCTCGT GGTATTACAA TTTCTACTGC ACACGTGGAA TACGAAACGG 550 

CCAAGAGACA TTATTCTCAC GTCGACTGTC CAGGCCACGC TGATTACATC 600 

AAGAATATGA TTACCGGTGC TGCTCAAATG GATGGTGCTA TCATTGTTGT 650 

AGCTGCTACC GATGGACAAA TGCCCCAAAC TAGAGAACAT TTACTTTTGG 700 

CCAGACAAGT TGGTGTCCAA CATATTGTCG TTTTTGTTAA CAAGGTTGAT 750 

ACCATTGATG ATCCAGAAAT GTTAGAGTTA GTCGAAATGG AAATGAGAGA 800 

ACTTTTAAAC GAATATGGGT TTGACGGTGA TAATGCTCCA ATTATCATGG 850 

GTTCTGCCCT TTGCGCTTTG GAAGGTCGCC AACCTGAAAT TGGGGAGCAG 900 

GCCATCATGA AACTTTTGGA TGCAGTGGAT GAGTATATTC CTACACCTGA 950 

AAGAGATTTG AACAAGCCTT TCTTGATGCC CGTTGAAGAT ATCTTCTCTA 1000 

TCTCCGGTAG AGGTACTGTG GTCACTGGTC GTGTGGAAAG GGGTAATTTA 1050 

AAGAAAGGTG AGGAATTGGA AATTGTTGGT CACAACTCCA CCCCATTGAA 1100 

AACAACAGTT ACTGGTATTG AAATGTTTAG AAAGGAATTG GACTCTGCTA 1150 

TGGCAGGTGA CAATGCCGGT GTTTTACTTA GAGGTATCAG GAGAGATCAA 12 00 

TTGAAGAGAG GTATGGTCTT AGCTAAGCCA GGTACCGTTA AAGCCCATAC 1250 

AAAGATTCTA GCCTCTTTGT ACATTTTATC CAAAGAGGAA GGTGGTAGAC 13 00 

ATTCTGGGTT TGGTGAAAAC TACAGACCAC AAATGTTTAT AAGAACAGCT 1350 

GATGTTACAG TTGTGATGAG ATTTCCTAAG GAGGTTGAAG '* ATCATTCTAT 14 00 

GCAAGTTATG CCAGGTGACA ATGTTGAAAT GGAATGTGAT TTGATCCATC 1450 

CTACCCCATT AGAAGTTGGT CAACGTTTCA ATATCAGAGA ; GGGTGGAAGA 1500 

ACTGTTGGTA CCGGTCTAAT CACACGTATT ATTGAATAGA CTTATTGATG 1550 

CAACTGGAGT ATATTTCTAT ATATTCTGTT CATTTCCCCT CTCATAATAT 1600 

ATACTTGTTT CGTTAAAATT TTATACGTGT AAATAAAGTG CCATAAATTT 1650 

TTCAGCTTTA CTTTTGGTAG AGTCCTGCTA GCACTAGATT TTACAATTTC 1700 

ATGTGCACAC ACC 1713 



2) INFORMATION FOR SEQ ID NO: 620 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 0 
ATTGGTGCAT TGCTACGT 

2) INFORMATION FOR SEQ ID NO: 621 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
10 (B) STRAIN: ATCC 19434 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 621 

TGGTGCAATC TTAGTTGTTT CTGCAACTGA CGGTCCGATG CCTCAAACAC 50 

15 GTGAACACAT TTTATTGTCA CGCCAAGTTG GTGTAAAATA CCTGATTGTT 100 

TTCTTGAACA AAGTTGATTT AGTCGATGAT GAAGAATTGA TCGATTTGGT 150 

AGAAATGGAA GTTCGCGAGT TATTGAGCGA ATATGGTTTC CCAGGCGATG 2 00 

ACACTCCTGT GATCAAAGGT TCCGCATTAA AAGCATTGCA AGGCGATCCA 2 50 

GATGCTGAAG CTGCTATTAT GGAATTGATG GATACAGTAG ATGAATATAT 3 00 

20 CCCAACACCA GAACGTGATA CAGATAAACC ATTACTATTG CCAGTGGAAG 3 50 

ACGTCTTCTC AATTACAGGT CGAGGAACTG TTGCCTCAGG TCGTATTGAT 4 00 

CGTGGTGCTG TTCGTGTCGG TGATGAGGTA GAGATCGTAG GGATCAAACC 450 

TGAAACACAA AAAGCAGTTG TAACAGGTGT AGAAATGTTC CGTAAAACGT 500 

TAGATTACGG GGAAGCTGGG GATAACGTAG GCGTGTTGTT ACGGGGGATC 550 

25 CAACGTGACG ATATCGAACG TGGACAAGTA CTTGCTAAAC CAGGTTCCAT 600 

TACTCCACAT ACAAAATTCA AAGCAGAAGT GTACGTGTTG ACAAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCAACA ACTATCGTCC ACAGTTCTAC 7 00 

TTCCGCACAA CTGATGTTAC AGGAACAATC ACATTGCCAG AAGATACAGA 7 50 
A 

30 



2) INFORMATION FOR SEQ ID NO: 622 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



751 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 
45 (B) STRAIN: ATCC 13264 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 622 

GTCAAATGGG ACGAATC C AG ATTCCAAGAA ATTGTCAAGG AAACCTCCAA 50 

50 CTTTATCAAG AAGGTTGGTT ACAACCCAAA GACTGTTCCA TTCGTCCCAA 100 

TCTCTGGTTG GAACGGTGAC AACATGATTG AAGCTACCAC CAACGCTCCA 150 

TGGTACAAGG GTTGGGAAAA GGAAACCAAG GCCGGTGTCG TCAAGGGTAA 2 00 

GACTTTGTTG GAAGCCATTG ACGCCATTGA ACAACCATCT AGACCAACTG 250 

ACAAGCCATT GAGATTGCCA TTGCAAGATG TTTACAAGAT TGGTGGTATT 3 00 

55 GGTACTGTGC CAGTCGGTAG AGTTGAAACC GGTGTCATCA AGCCAGGTAT 3 50 

GGTTGTTACT TTCGCCCCAG CTGGTGTTAC CACTGAAGTC AAGTCCGTTG 4 00 

AAATGCATCA CGAACAATTG GAACAAGGTG TTCCAGGTGA CAACGTTGGT 4 50 

TTCAACGTCA AGAACGTTTC CGTTAAGGAA ATCAGAAGAG GTAACGTCTG 500 

TGGTGACGCT AAGAACGATC CACCAAAGGG TTGCGCTTCT TTCAACGCTA 550 

60 CCGTCATTGT TTTGAACCAT CCAGGTCAAA TCTCTGCTGG TTACTCTCCA 600 
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GTTTTGGATT GTCACACTGC TCACATTGCT 
GGAAAAGAAC GACAGAAGAT CTGGTAAGAA 
TCTTGAAGTC CGGTGACGCT GCTTTGGTCA 



TGTAGATTCG ACGAATTGTT SSfi 
GTTGGAAGAC CATCCAAAGT 700 
AGTTCGTTCC ATCTAAGCCA 750 



2) INFORMATION FOR SEQ ID NO: 623 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1269 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus neoformans 

(B) STRAIN: ATCC 44104 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 623 

TCTTGAAAGC TTAAGGCCGA GCGAGAGCGA GGTATCACCA TCGACATTGC 50 

TCTTTGGAAG TTCGAGACCC CCAGGTACCA GGTCACCGTC ATTGACGCCC 100 

CCGGTCACCG AGACTTCATC AAGAACATGA TCACCGGTAC CTCCCAGGCT 150 

GACTGTGCCA TCCTCATCAT TGCCACCGGT ATCGGTGAGT TCGAGGCCGG 200 

TATCTCCAAG GACGGTCAGA CCCGAGAGCA CGCCCTCCTC GCCTTCACCC 250 

TCGGTGTCAG GCAGCTCATT GTTGCTTGCA ACAAGATGGA CACCTGCAAG 300 

TGGTCCGAGG ACCGATTCAA CGAAATCGTC AAGGAGACCA ACGGTTTCAT 350 

CAAGAAGGTT GGTTACAACC CCAAGGCTGT CCCCTTCGTC CCCATCTCTG * 400 

GTTGGCACGG TGACAACATG TTGGAGGAGA CCACCAAGTC AGTGAATCCG 450 

CTTCTACGTG ATGAGATGTT TTTCTGACTT TCCGTGCAGC ATGCCCTGGT 500 

ACAAGGGATG GACCAAGGAG ACCAAGTCTG GTGTTTCCAA GGGTAAGACC 550 

CTCCTCGAGG CCATCGACGC CATCGAGCCC CCTACCCGAC CCACCGACAA 600 

GCCCCTCCGT CTCCCTCTCC AGGACGTCTA CAAGATCGGT GGTATCGGCA 650 

CAGTCCCTGT CGGCCGAGTC GAGACCGGTG TCATCAAGGC CGGTATGTTG 700 

TCTCATCTCT CTTGTCTCGT AACATGCGTC TCGTAACATG CGCTTACTTC 750 

ATTTTCAGGT ATGGTCGTCA AGTTCGCCCC CACCAACGTC ACCACTGAAG 800 

TCAAGTCCGT TGAGATGCAC CACGAGCAGA TCCCCGAGGG TCTTCCCGGA 850 

GACAACGTTG GTTTCAACGT CAAGAACGTT TCCATCAAGG ACATCCGACG 900 

AGGTAACGTC TGTGGTGACT CCAAGAACGA CCCCCCTATG GAGGCTGCTT 950 

CTTTCAACGC CCAGGTTATC GTCCTTAACC ACCCTGGTCA GATCGGTGCC 1000 

GGTTACACCC CCGTTCTCGA CTGTCACACT GCCCGTAAGC CTGACCCAAT 1050 

ACCTCCAACA TACCTTTGAA GCTGACCCTT TCTAGACATT GCCTGCAAGT 1100 

TTGCTGAGTT GATCGAGAAG ATTGACCGAC GAACCGGTAA GGTCATGGAG 1150 

GCCGCCCCCA AGTTCGTCAA GTCTGGTGAC GCCGCCATTG TCAAGCTTGT 1200 

TGC CCAGAAG CCCCTCTGTG TTGAGAC CT A CGCCGACTAC CCCCCTCTTG 1250 

GTCGATTCGC CGTCCGAGA 1269 



2) INFORMATION FOR SEQ ID NO: 624 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 753 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 36801 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 624 

TCTGTCAAAT GGGACAAAAA CAGATTTGAA GAAATCATCA AGGAAACCTC 50 

CAACTTCGTC AAGAAGGTTG GTTACAACCC AAAGACTGTT CCATTCGTTC 100 

10 CAATCTCTGG TTGGAATGGT GACAACATGA TTGAACCATC CACCAACTGT 150 

CCATGGTACA AGGGTTGGGA AAAGGAAACC AAATCCGGTA AAGTTACTGG 2 00 

TAAGACCTTG TTAGAAGCTA TTGACGCTAT TGAACCACCA ACCAGACCAA 250 

CCGACAAACC ATTGAGATTG CCATTGCAAG ATGTTTACAA GATCGGTGGT 300 

ATTGGTACTG TGCCAGTCGG TAGAGTTGAA ACTGGTATCA TCAAAGCCGG 3 50 

15 TATGGTTGTT ACTTTCGCCC CAGCTGGTGT TACCACTGAA GTCAAATCCG 400 

TTGAAATGCA TCACGAACAA TTGGCTGAAG GTGTTCCAGG TGACAATGTT 450 

GGTTTCAACG TTAAGAACGT TTCCGTTAAA GAAATTAGAA GAGGTAACGT 500 

TTGTGGTGAC TCCAAGAACG ATCCACCAAA GGGTTGTGAC TCTTTCAATG 550 

CCCAAGTCAT TGTTTTGAAC CATCCAGGTC AAATCTCTGC TGGTTACTCT 600 

20 CCAGTCTTGG ATTGTCACCC TGCCCACATT GCTTGTAAAT TCGACACTTT 650 

GGTTGAAAAG ATTGACAGAA GAACTGGTAA GAAATTGGAA GAAAATCCAA 700 

AATTCGTCAA ATCCGGTGAT GCTGCTATCG TCAAGATGGT CCCAACCAAA 750 

CCA 753 



25 



2) INFORMATION FOR SEQ ID NO: 625 



(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 625 
CGTTGAAGAC ACGACCCAAA GTATCC 26 

40 

2) INFORMATION FOR SEQ ID NO: 626 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 626 
5 5 TACCACCTTT TAAGTAAGGT GCTAAT 2 6 



60 



2) INFORMATION FOR SEQ ID NO: 627 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 627 
ATTGTCTATA AAAATGGCGA TAAGTC 



2) INFORMATION FOR SEQ ID NO: 628 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 628 
AAAATGGCGA TAAGTCACAA AAAGTA 



2) INFORMATION FOR SEQ ID NO: 62 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 9 
AAGTTCCATC TCAACAAGGT CAATA 



2) INFORMATION FOR SEQ ID NO: 630 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 630 
CGGAGCTATC CTAGTCGTTT CA 
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10 



20 



50 



60 



2) INFORMATION FOR SEQ ID NO: 631 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 631 
15 CAGACCAACY GAIAARCCAT TRAGAT 26 



2 ) INFORMATION FOR SEQ ID NO: 632 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 632 

30 

. CCCTTTGGTG GRTCSTKCTT GGA 23 



35 2) INFORMATION FOR SEQ ID NO: 633 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 633 

CAGACCAACY GAIAARCCIT TRAGAT 26 



2) INFORMATION FOR SEQ ID NO: 634 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 6 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 634 

AACACYGTCA GRRCIATTGC YATGGA 



2) INFORMATION FOR SEQ ID NO: 635 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 635 
AAACCRGTIA RRGCRACTCT IGCTCT 



2) INFORMATION FOR SEQ ID NO: 636 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA : 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 636 
ACTGGYGTTG AIATGTTCCG YAA 



2) INFORMATION FOR SEQ ID NO: 637 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 637 
ACGTCAGT I G TACGGAARTA GAA 



2 ) INFORMATION FOR SEQ ID NO: 638 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 638 
CCAATGCCAC AAACI CGTGA RCACAT 2 6 



2) INFORMATION FOR SEQ ID NO: 639 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 28 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 639 
TTTACGGAAC ATTTCWACAC CWGTIACA 2 8 



2) INFORMATION FOR SEQ ID NO: 64 0 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid* 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 640 
TCCATGGTIT WYGGICARAT GAA 23 

40 

2) INFORMATION FOR SEQ ID NO: 641 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 641 
55 TGATAACCWA CIGCIGAIGG CATACG 26 



2) INFORMATION FOR SEQ ID NO: 642 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 642 
GGCGTIGGIG ARCGIACICG TGA 



2) INFORMATION FOR SEQ ID NO: 643 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 643 
ACTGGIGTIG ARATGTTCCG YAA 



2) INFORMATION FOR SEQ ID NO: 644 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 644 
ACGTC I GT I G T I CKGAARTA GAA 



2) INFORMATION FOR SEQ ID NO: 645 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 645 
ACGTC I GT I G TIC KGAARTA RAA 
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2) INFORMATION FOR SEQ ID NO: 646 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 646 
ATCGACAAGC CITTCYTIAT GSC 



2) INFORMATION FOR SEQ ID NO: 647 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 647 
ACGTCCGTSG TRCGGAAGTA GAACTG 



2) INFORMATION FOR SEQ ID NO: 648 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 8 
ACGTCSGTSG TRCGGAAGTA GAACTG 



2) INFORMATION FOR SEQ ID NO: 649 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 649 

GTCCTATGCC TCARACWCGI GAGCAC 



2) INFORMATION FOR SEQ ID NO: 650 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 650 
TTACGGAACA TYTCAACACC IGT 



2) INFORMATION FOR SEQ ID NO: 651 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 651 
TGACGACCAC CITCYTCYTT YTTCA 



2) INFORMATION FOR SEQ ID NO: 652 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 652 
CCWAYAGTIY KICCICCYTC YCTIATA 



2) INFORMATION FOR SEQ ID NO: 653 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 653 
GAYTTCATIA ARAAYATGAT 2 0 



2) INFORMATION FOR SEQ ID NO: 654 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 654 
TACAARAT YK G I GGTATYGG 2 0 



2) INFORMATION FOR SEQ ID NO : 655 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 655 
CCRATACC I C MRATYTTGTA 2 0 

40 

2) INFORMATION FOR SEQ ID NO : 656 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 656 
55 AATTAATGGC TGCAGTTGAY GA 22 



2) INFORMATION FOR SEQ ID NO: 657 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
5 (D) TOPOLOGY: Linear 
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10 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 657 
TTGTCCACGT TCGATRTCTT CA 



22 



15 2) INFORMATION FOR SEQ ID NO: 658 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 658 



GATYTAGTCG ATGATGAAGA ATT 



23 



30 



2) INFORMATION FOR SEQ ID NO: 659 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 659 



GCTTTTTGIG TTTCWGGTTT RAT 



23 



45 



2) INFORMATION FOR SEQ ID NO: 660 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 660 



60 



GTAGAATTGA GGACGGTAGT TAG 



23 
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2 ) INFORMATION FOR SEQ ID NO: 661 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



10 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 661 
15 GTAGAAYTGT GGWCGATART TRT 23 



2) INFORMATION FOR SEQ ID NO: 662 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 
(C> STRANDEDNESS: Double 

25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
3 0 (A) ORGANISM: CoryneJbacteriim? diphtheriae 

(B) STRAIN: ATCC 27010 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 662 

35 CGGCGCAATC CTCGTTGTTG CTGCCACCGA CGGCCCAATG CCTCAGACCC 50 

GTGAGCACGT TCTGCTCGCT CGCCAGGTCG GCGTTCCTTA CATCCTCGTT 100 

GCTCTGAACA AGTGCGACAT GGTTGATGAT GAGGAAATCA TCGAGCTCGT 150 

CGAGATGGAG ATCCRTGAGC TGCTCGCTGA GCAGGATTAC GACGAAGAGG 200 

CTCCAATCAT CCACATCTCC GCACTGAAGG CTCTTGAGGG CGACGAGAAG 250 

40 TGGACCCAGT CCATCATCGA CCTCATGCAG GCTTGCKATG ATTCCATCCC 300 

AGACCCAGAG CGTGAGACCG ACAAGCCATT CCTCATGCCT ATCGAGGACA 350 

TCTTCACCAT CACCGGCCGC GGTACCGTTG TTACCGGCCG TGTTGAGCGT 400 

GGCTCCCTGA AGGTCAACGA GGACGTCGAG ATCATCGGTA TCCGCGAGAA 450 

KGCTACCACC ACCACCGTTA CCGGTATCGA GATGTTCCGT AAGCTTCTCG 500 

45 ACTACACCGA GGCTGGCGAC AACTGTGGTC TGCTTCTCCG TGGCGTTAAG 550 

CGCGAAGACG TTGAGCGTGG CCAGGTTGTT GTTAAGCCAG GCGCTTACAC 600 

CCCTCACACC GAGTTCGAGG GCTCTGTCTA CGTTCTGTCC AAGGACGAGG 650 

GTGGCCGCCA CACCCCATTC TTCGACAACT ACCGCCCACA GTTCTACTTC 700 

CGCACCACCG ACGTTACCGG TGTTGTGAAG CTTCCTGAGG GCACCGAGAT 750 

50 GGTCATGCCT GGCGACAACG TCGACATGTC CGTCACCCTG ATCCAGCCTG 800 

TCGCTATGGA TGAGGGCCTG CGCTTCGCTA TC 832 



55 2) INFORMATION FOR SEQ ID NO: 663 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1192 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 

373 
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(D) TOPOLOGY: Linear 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida catenulata 

(B) STRAIN: ATCC 10565 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 663 



10 



AACGGCGAGC AAGACTTGGT GTTGGAGGTT TCTCAGCACT TGGGTGAGAA 50 

CACCGTGCGT ACCATTGCCA TGGACGGTAC CGAGGGTTTG GTGAGAGGTA 100 

CCGCTGTCAC TGACACCGGT GCTCCCATCT CGGTCCCCGT TGGTCAGGGT 150 

ACCTTGGGCC GGATCATCAA CGTTGTCGGT GAGCCCATCG ACGAGCGTGG 2 00 

15 TCCCATCCAG TGCAAGCAGA GAAACCCCAT TCACGCCGAG CCCCCGTCTT 250 

TCACCGAGCA GTCCGTCGAG GCTGAGGTGT TGGAGACCGG TATCAAGGTT 300 

GTCGACTTGT TGGCTCCCTA CGCCCGTGGT GGTAAGATTG GTCTTTTCGG 350 

TGGTGCCGGT GTCGGTAAGA CCGTGTTCAT CCAGGAGTTG ATTAACAACA 400 

TTGCCAAGGC CCACGGTGGT TTCTCCGTGT TCACTGGTGT CGGTGAGCGT 450 

20 ACTCGTGAGG GTAACGACTT GTACCGTGAA ATGAAGGAGA CCGGTGTCAT 500 

CAACTTGGAG GGCGACTCCA AGGTGGCCTT GGTGTTCGGT CAGATGAACG 550 

AGCCCCCGGG GGCTCGTGCC CGTGTCGCCT TGACCGGTTT GACCATTGCC 600 

GAGTACTTCC GTGACGAGGA AGGCCAGGAC GTGTTGTTGT TCGTTGACAA 650 

CATTTTCAGA TTCACCCAGG CCGGTTCCGA GGTGTCGGCG TTGTTGGGTC 700 

25 GTATCCCCTC CGCCGTCGGT TACCAGCCCA CTTTGGCCAC CGACATGGGT 750 

TTGTTGCAGG AGAGAATTAC CACCACCAAG AAGGGTTCCG TCACCTCTGT 800 

GCAGGCCGTG TACGTCCCTG CCGATGACTT GACTGACCCT GCCCCCGCCA 850 

CCACTTTCGC TCACTTGGAC GCCACCACCG TGTTGTCGCG TGGTATCTCC 900 

GAGTTGGGTA TCTACCCCGC CGTCGACCCC TTGGACTCCA AGTCGAGATT 950 

30 GTTGGACGTC GAGGTTGTTG GCCAGGAGCA CTACGACGTC GCCACCGGTG 1000 

TCCAGGAGTG CTTGCAGGCC TACAAGTCGT TGCAGGACAT CATTGCCATT 1050 

TTGGGTATGG ACGAGTTGTC CGAGCAGGAC AAGTTGACCG TCGAGAGAGC 1100 

CCGTAAGATC CAGCGTTTCT TGTCGCAGCC CTTCGCTGTC GCCGAGGTTT 1150. 

TCACTGGTAT CCCCGGTAGA TTGGTGAGAT TGCAGGACAC CG 1192 

35 

2) INFORMATION FOR SEQ ID NO: 664 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 664 
50 AAYATGAT I A CIGGIGCIGC ICARATGGA 29 



2 ) INFORMATION FOR SEQ ID NO: 665 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 

374 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(C) ACCESSION NUMBER: extracted from X00779 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 665 

ATGGGTAAAG AGAAGTCTCA CATTAACGTT GTCGTTATCG GTCATGTCGA 50 

TTCTGGTAAG TCTACCACTA CCGGTCATTT GATTTACAAG TGTGGTGGTA 100 

TTGACAAGAG AACCATCGAA AAGTTCGAAA AGGAAGCCGC TGAATTAGGT 150 

AAGGGTTCTT TCAAGTACGC TTGGGTTTTG GACAAGTTAA AGGCTGAAAG 200 

AGAAAGAGGT ATCACTATCG ATATTGCTTT GTGGAAGTTC GAAACTCCAA 250 

AGTACCAAGT TACCGTTATT GATGCTCCAG GTCACAGAGA TTTCATCAAG 300 

AACATGATTA CTGGTACTTC TCAAGCTGAC TGTGCTATCT TGATTATTGC 350 

TGGTGGTGTC GGTGAATTCG AAGCCGGTAT CTCTAAGGAT GGTCAAACCA 400 

GAGAACACGC TTTGTTGGCT TTCACCTTGG GTGTTAGACA ATTGATTGTT 450 

GCTGTCAACA AGATGGACTC CGTCAAATGG GACGAATCCA GATTCCAAGA 500 

AATTGTCAAG GAAACCTCCA ACTTTATCAA GAAGGTTGGT TACAACCCAA 550 

AGACTGTTCC ATTCGTCCCA ATCTCTGGTT GGAACGGTGA CAACATGATT 600 

GAAGCTACCA CCAACGCTCC ATGGTACAAG GGTTGGGAAA AGGAAACCAA 650 

GGCCGGTGTC GTCAAGGGTA AGACTTTGTT GGAAGCCATT GACGCCATTG 700 

AACAACCATC TAGACCAACT GACAAGCCAT TGAGATTGCC ATTGCAAGAT 750 

GTTTACAAGA TTGGTGGTAT TGGTACTGTG CCAGTCGGTA GAGTTGAAAC 800 

CGGTGTCATC AAGC CAGGTA TGGTTGTTAC TTTTGCCCCA GCTGGTGTTA 850 

CCACTGAAGT CAAGTCCGTT GAAATGCATC ACGAACAATT GGAACAAGGT 900 

GTTCCAGGTG ACAACGTTGG TTTCAACGTC AAGAACGTTT CCGTTAAGGA 950 

AATCAGAAGA GGTAACGTCT GTGGTGACGC TAAGAACGAT CCACCAAAGG 1000 

GTTGCGCTTC TTTCAACGCT ACCGTCATTG TTTTGAACCA TCCAGGTCAA 1050 

ATCTCTGCTG GTTACTCTCC AGTTTTGGAT TGTCACACTG CTCACATTGC 1100 

TTGTAGATTC GACGAATTGT TGGAAAAGAA CGACAGAAGA TCTGGTAAGA .1150 

AGTTGGAAGA CCATCCAAAG TT.CTTGAAGT CCGGTGACGC TGCTTTGGTC 1200 

AAGTTCGTTC CATCTAAGCC AATGTGTGTT GAAGCTTTCA GTGAATACCC 1250 

ACCATTAGGT AGATTCGCTG TCAGAGACAT GAGACAAACT GTCGCTGTCG 1300 

GTGTTATCAA GTCTGTTGAC AAGACTGAAA AGGCCGCTAA GGTTACCAAG 1350 

GCTGCTCAAA AGGCTGCTAA GAAATAA 1377 



2) INFORMATION FOR SEQ ID NO: 666 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1536 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(C) ACCESSION NUMBER: extracted from M12082 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 666 

ATGGTTTTGC CAAGACTATA TACTGCTACA TCCCGTGCTG CTTTTAAAGC 50 

AGCCAAACAA TCCGCTCCGC TTCTATCCAC TTCGTGGAAA AGATGTATGG 100 

CCTCAGCTGC TCAATCTACT CCAATCACCG GTAAAGTTAC CGCTGTCATT 150 

GGTGCCATTG TTGACGTTCA TTTTGAACAA TCAGAGTTGC CCGCTATTTT 200 

375 
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GAACGCTTTA GAAATTAAAA CACCTCAAGG TAAGTTGGTT TTGGAAGTTG 250 

CTCAACATTT GGGTGAAAAC ACTGTCAGAA CCATTGCTAT GGATGGTACC 3 00 

GAAGGTTTGG TCCGTGGTGA AAAGGTTCTT GACACTGGTG GCCCTATCTC 350 

CGTCCCAGTT GGGAGAGAAA CTTTAGGGAG AATCATCAAC GTTATCGGTG 400 

5 AACCTATTGA TGAAAGAGGT CCAATTAAGT CCAAACTAAG AAAGCCAATT 450 

CACGCAGACC CTCCTAGTTT TGCAGAACAA TCTACTTCGG CTGAAATTTT 500 

GGAAACAGGT ATCAAAGTCG TCGATCTATT AGCTCCTTAT GCCAGAGGTG 550 

GTAAGATTGG TCTTTTCGGT GGTGCAGGTG TCGGTAAGAC TGTGTTCATT 600 

CAAGAATTGA TTAACAATAT CGCCAAGGCC CATGGTGGTT TTTCCGTTTT 650 

10 CGCCGGTGTT GGTGAAAGGA CCAGAGAGGG TAATGACTTG TACCGTGAAA 700 

TGAAGGAAAC TGGAGTCATT AACTTGGAAG GTGAATCCAA GGTCGCCTTA 750 

GTTTTCGGTC AAATGAACGA ACCTCCAGGA GCCAGAGCCA GAGTCGCTTT 800 

AACTGGTTTG ACGATCGCTG AATATTTCAG AGATGAAGAA GGTCAAGACG 850 

TCTTGTTGTT TATCGACAAT ATCTTTAGAT TTACTCAAGC TGGTTCAGAA 900 

15 GTCTCTGCCC TTTTGGGTCG TATTCCATCT GCCGTCGGTT ATCAACCAAC 950 

TTTGGCCACT GATATGGGTC TCTTACAAGA AAGAATTACC ACCACAAAGA 1000 

AGGGTTCTGT CACTTCTGTG CAAGCCGTTT ATGTTCCAGC CGATGATTTA 1050 

ACAGATCCGT CTCCGTCCAC ATCTTTTGCC CATTTGGACG CATCATCCGT 1100 

CTTGTCAAGA GGTATTTCAG AATTAGGTAT TTACCCTGCA GTGGATCCAT 1150 

20 TGGATTCTAA ATCAAGGTTA TTGGATGCCG CCGTTGTCGG TCAAGAACAT 12 00 

TATGACGTCG CCTCCAAGGT TCAAGAAACT TTACAGACCT ATAAATCTTT 1250 

ACAAGATATC ATTGCTATTT TGGGTATGGA TGAATTGTCC GAACAAGATA 1300 

AACTAACTGT CGAAAGGGCA AGAAAGATTC AAAGATTCTT ATCTCAACCA 1350 

TTTGCTGTCG CCGAAGTCTT TACTGGTATC CCAGGTAAAT TAGTGAGATT 1400 

25 AAAGGACACC GTTGCCTCGT TCAAAGCCGT TTTGGAAGGT AAATACGATA 1450 

ATATACCAGA ACATGCTTTC TATATGGTTG GTGGTATTGA AGATGTTGTT 1500 

CGTAAAGCTG AAAAGTTAGC CCGTGAAGCC AACTAG 1536 



30 



2 ) INFORMATION FOR SEQ ID NO: 667 



(i) SEQUENCER CHARACTERISTICS : 
(A) LENGTH: 12 93 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: Y 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 667 

CGGAAGGCGT GCCGCCGGTG TTAACTGCTT TGGATGTGGT GGAAAAACTT 50 

GGCCGTGACG AGCCCTTGAC ACTTGAAATT GTACAGCACT TGGACGCCCA 100 

CACCGGCCGT TGCATTGCGA TGCAGACGAC AGATCTGCTT AAGCTGAAAT 150 

50 CGAAGGTAGT CTCGAGTGGC GGGAACATTT CTGTGCCTGT GGGTCGGGAG 200 

ACACTTGGCA GAATCTTCAA TGTGCTGGGC GATGCCATTG ACCAGCGCGG 250 

CCATGTTGGA GAGAAGCAAC GCATGCCTAT TCACGCTGTG GCACCAAAGT 300 

TGGCGGATCA GGCGGCAGAG GACACCATTC TCACAACGGG TATCAAGGTG 350 

ATTGATCTGA TTTTGCCCTA CTGCAAGGGT GGGAAGATTG GTCTTTTTGG 400 

55 TGGTGCTGGC GTGGGCAAGA CGGTTATTAT CATGGAACTT ATTAACAACG 450 

TTGCCAAGGG CCATGGTGGG TTCTCCGTCT TTGCGGGTGT TGGTGAACGC 500 

ACGCGTGAGG GGACGGATCT TTACCTGGAG ATGATGCAAT CCAAAGTTAT 550 

TGACCTGAAG GGTGACTCGA AGTGTGTGTT GGTGTATGGT CAGATGAACG 600 

AGCCTCCGGG TGCCCGTGCT CGTGTGGCGC AGTCTGCCTT GACAATGGCC 650 

60 GAGTACTTCC GTGATGTGGA AGGGCAAGAC GTGTTGCTTT TTATCGACAA 700 

376 
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CATTTTCCGC TTTACGCAGG CAAACTCTGA GGTGTCAGCG CTOTTGGGTC 750 

GTATTCCCGC TGCCGTCGGC TACCAGCCTA CCCTTGCTGA GGATCTTGGG ^800 

CAGTTGCAGG AGCGCATTAC GTCCACGACA AAAGGTTCCA TTACCTCTGT 850 

GCAGGCTGTG TACGTGCCAG CCGATGATAT TACCGACCCT GCGCCAGCAA 900 

5 CGACCTTTTC CCACCTCGAT GCCACGACGG TGCTGGACCG TGCCGTTGCC 950 

GAATCCGGCA TTTACCCCGC TGTCAACCCA CTGGAGTGTG CCTCGCGTAT 1000 

CATGGACCCG GATGTAATCA GCGTTGACCA CTACAACGTG GCGCAGGATG 1050 

TGGTGCAGAT GCTTACCAAG TACAAGGAGC TGCAGGATAT CATTGCGGTG 1100 

CTTGGCATTG ATGAGCTCAG TGAGGAGGAT AAACTTATCG TGGACCGTGC 1150 

10 GCGTAAGGTG ACAAAGTTTC TCTCCCAGCC TTTCCAGGTG GCGGAGGTGT 12 00 

TTACTGGCAT GACAGGCCAC TACGTGCAGC TGGAGGAGAC CATTGAGTCT 12 50 

TTTTCCGGCC TGTTGATGGG CACATATGAT CAGGTGCCGG AGA 12 93 



15 

2) INFORMATION FOR SEQ ID NO: 668 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1191 bases 
2 0 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium glutamicum 
(C) ACCESSION NUMBER: X77034 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 668 

GTGGCAAAGG CGAAGTTCGA GCGTACCAAG CCCCACGTAA ACATCGGCAC 50 

CATCGGTCAC GTTGACCACG GTAAGACCAC CACCACCGCG GCTATCACCA 100 

AGGTTCTGGC TGACACTTAC CCTGAGCTCA ACGAGGCTTT CGCCTTCGAC 150 

35 TCCATCGATA AGGCTCCTGA GGAGAAGGAG CGTGGCATCA CGATCAACAT 2 00 

CTCCCACGTT GAGTACCAGA CTGAAAAGCG CCACTACGCA CACGTTGACG 2 50 

CTCCAGGCCA CGCCGACTAC ATCAAGAACA TGATTACCGG CGCTGCTCAG 3 00 

ATGGACGGCG CAATCCTCGT TGTTGCTGCT ACCGACGGCC CAATGCCTCA 3 50 

GACCCGTGAG CACGTTCTTC TTGCTCGCCA GGTTGGCGTT CCTTACATCC 4 00 

40 TCGTTGCTCT TAACAAGTGC GACATGGTTG AGGATGAGGA AATCATCGAG 4 50 

CTCGTCGAGA TGGAAGTTCG TGAACTTCTT GCTGAGCAGG ACTACGACGA 500 

AGAGGCTCCA ATTGTTCACA TCTCCGCTCT GAAGGCTCTT GAGGGCGACG 550 

AGAAGTGGGG CAAGCAGATC CTTGAGCTCA TGCAGGCTTG CGATGACAAC 600 

ATCCCTGACC CAGTTCGTGA GACCGACAAG CCATTCCTCA TGCCTATCGA 650 

45 GGACATCTTC ACCATCACCG GTCGTGGCAC CGTTGTTACC GGTCGTGTTG 700 

AGCGCGGTAC CCTGAACGTG AACGATGATG TTGACATCAT CGGCATCAAG 750 

GAGAAGTCCA CCTCCACCAC CGTTACCGGT ATCGAGATGT TCCGTAAGCT 800 

TCTTGACTCC GCTGAGGCTG GCGACAACTG TGGTCTGCTT CTCCGTGGTA 850 

TCAAGCGCGA AGATGTTGAG CGTGGCCAGG TTATCGTTAA GCCAGGCGCT 900 

50 TACACCCCTC ACACCGAGTT CGAGGGCTCT GTCTACGTTC TGTCCAAGGA 950 

TGAAGGTGGC CGCCACACCC CATTCTTCGA CAACTACCGT CCTCAGTTCT 1000 

ACTTCCGCAC CACCGACGTT ACCGGTGTTG TGAAGCTTCC AGAGGGCACC 1050 

GAGATGGTCA TGCCTGGCGA CAACGTCGAC ATGTCCGTCA CCCTGATCCA 1100 

GCCTGTCGCT ATGGACGAGG GCCTGCGTTT CGCTATCCGC GAAGGCTCCC 1150 

55 GCACCGTTGG CGCTGGTCGT GTCACCAAGA TCATCAAGTA A 1191 



2) INFORMATION FOR SEQ ID NO: 669 

60 

377 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: extracted from V00267 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 669 

15 ATGGCTACTG GAAAGATTGT CCAGGTAATC GGCGCCGTAG TTGACGTCGA 50 

ATTCCCTCAG GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTGCAAA 100 

ATGGTAATGA GCGTCTGGTG CTGGAAGTTC AGCAGCAGCT CGGCGGCGGT 150 

ATCGTACGTA CCATCGCAAT GGGTTCCTCC GACGGTCTGC GTCGCGGTCT 200 

GGATGTAAAA GACCTCGAAC ACCCGATTGA AGTCCCGGTA GGTAAAGCGA 250 

20 CTCTGGGCCG TATCATGAAC GTACTGGGTG AACCGGTCGA CATGAAAGGC 300 

GAGATCGGTG AAGAAGAGCG TTGGGCGATT CACCGCGCAG CACCTTCCTA 350 

CGAAGAGCTG TCAAACTCTC AGGAACTGCT GGAAACCGGT ATCAAAGTTA 400 

TCGACCTGAT GTGTCCGTTC GCTAAGGGCG GTAAAGTTGG TCTGTTCGGT 450 

GGTGCGGGTG TAGGTAAAAC CGTAAACATG ATGGAGCTCA TTCGTAACAT 500 

25 CGCGATCGAG CACTCCGGTT ACTCTGTGTT TGCGGGCGTA GGTGAACGTA 550 

CTCGTGAGGG TAACGACTTC TACCACGAAA TGACCGACTC CAACGTTATC 600 

GACAAAGTAT CCCTGGTGTA TGGCCAGATG AACGAGCCGC CGGGAAACCG 650 

TCTGCGCGTT GCTCTGACCG GTCTGACCAT GGCTGAGAAA TTCCGTGACG 700 

AAGGTCGTGA CGTTCTGCTG TTCGTTGACA ACATCTATCG TTACACCCTG 750 

30 GCCGGTACGG AAGTATCCGC ACTGCTGGGC CGTATGCCTT CAGCGGTAGG 800 

TTATCAGCCG ACCCTGGCGG AAGAGATGGG CGTTCTGCAG GAACGTATCA 850. 

CCTCCACCAA AACTGGTTCT ATCACCTCCG TACAGGCAGT ; ATACGTACCT 900 

GCGGATGACT . TGACTGACCC GTCTCCGGCA ACCACCTTTG CGCACCTTGA 950 

CGCAACCGTG GTACTGAGCC GTCAGATCGC GTCTCTGGGT ATCTACCCGG 1000 

35 CCGTTGACCC GCTGGACTCC ACCAGCCGTC AGCTGGACCC GCTGGTGGTT 1050 

GGTCAGGAAC ACTACGACAC CGCGCGTGGC GTTCAGTCCA TCCTGCAACG 1100 

TTATCAGGAA CTGAAAGACA TCATCGCCAT CCTGGGTATG GATGAACTGT 1150 

CTGAAGAAGA CAAACTGGTG GTAGCGCGTG CTCGTAAGAT CCAGCGCTTC 1200 

CTGTCCCAGC CGTTCTTCGT GGCAGAAGTA TTCACCGGTT CTCCGGGTAA 1250 

40 ATACGTCTCC CTGAAAGACA CCATCCGTGG CTTTAAAGGC ATCATGGAAG 1300 

GCGAATACGA TCACCTGCCG GAGCAGGCGT TCTACATGGT CGGTTCCATC 1350 

GAAGAAGCTG TGGAAAAAGC CAAAAAACTT TAA 13 83 



45 



55 



60 



2) INFORMATION FOR SEQ ID NO: 670 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1410 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(B) STRAIN: NCTC 11638 

(C) ACCESSION NUMBER: extracted from AF004 014 

378 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 670 



PCT/CAOO/01150 



ATGAAAGCGA TGGAAGGTAA AATCATTCAG GTTTTAGGCC CGGTGGTAGA 50 

TGTGGAGTTT GAATCCTATC TGCCGGCGAT TTTTGAAGCA CTAGACATTA 100 

5 ATTTTGAAGT TAATGGCGTT CAAAAATCTT TAGTTTTAGA GGTGGCAGCC 150 

CATTTGGGTG GTAATCGGGT GCGAGCGATT GCTATGGATA TGACAGAAGG 200 

CTTAGTGCGT AACCAAGCCG TCAAAGCTCG TGGCAAAATG ATTGAAGTGC 250 

CTGTGGGCGA AGAAGTGTTA GGGCGTATTT TTAATGTTGT GGGCGAGAGC 300 

ATTGATAATT TAGAGCCGCT TAAGCCGTCC TTAACTTGGC CCATTCACAG 350 

10 AAAAGCCCCT AGTTTTGAGC AGCAAAGCAC TAAAACAGAA ATGTTTGAAA 400 

CCGGTATTAA AGTCATTGAC TTGCTCGCGC CTTATTCTAA GGGCGGTAAA 450 

GTAGGCTTGT TTGGTGGGGC TGGCGTAGGC AAAACGGTGA TCATTATGGA 500 

GCTTATCCAC AATGTGGCTT ATAAGCATAA CGGGTATTCG GTGTTTGCAG 550 

GTGTGGGGGA GCGCACCAGA GAAGGGAACG ATCTGTATTT TGAGATGAAA 600 

15 GAAGGGGGCG TTTTAGACAA AGTTGCGTTG TGCTATGGGC AAATGAATGA 650 

GCCACCAGGT GCAAGGAATC GCATCGCATT CACCGGCTTG ACGATGGCGG 700 

AGTATTTCCG TGATGAAAAG GGCTTAGATG TGTTGATGTT TATTGATAAC 750 

ATCTTTAGAT ACGCTCAAAG CGGTGCGGAA ATGAGCGCGC TATTAGGCCG 800 

TATCCCTTCA GCGGTGGGGT ATCAGCCCAC GCTAGCCGGG GAAATGGGGA 850 

20 AACTTCAAGA GCGTATCGCT TCCACTAAAA ATGGCTCTAT CACTTCGGTT 900 

CAAGCGGTGT ATGTGCCAGC AGACGACTTG ACTGACCCAG CCCCTGCTTC 950 

GGTGTTTGCG CATTTAGATG CGACTACGGT GTTGAATAGA AAGATCGCTG 1000 

AAAAAGGGAT TTATCCGGCG GTGGATCCTT TGGATTCCAC TTCAAGGATT 1050 

TTAAGCCCTC AAATGATTGG CGAGAAGCAC TATGAAATCG CCACCGGTAT 1100 

25 CCAGCAAGTT TTGCAAAAAT ACAAGGATTT GCAAGATATT ATTGCGATTT 1150 

TGGGATTGGA CGAATTGAGC GAAGAGGATA AAAAAACGGT TGAAAGGGCC 1200 

AGAAAAATTG AGAAGTTTTT ATCCCAGCCG TTTTTTGTGG CTGAAGTGTT 1250 

TACAGGAAGT CCCGGTAAGT ATGTGACTCT CCAAGAGACT TTAGAGGGCT 1300 

TTGGAGGGAT TTTAGAGGGC AAATACGATC ACATTCCTGA AAACGCGTTT 1350 

3 0 TACATGGTGG GCAGCATTCA AGAGGTTTTA GAAAAAGCTA AAAACATGAA . 1400 

AAATTCCTAA 1410 



35 2) INFORMATION FOR SEQ ID NO: 671 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 01 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium acetobutylicum 

(B) STRAIN: DSM 792 

(C) ACCESSION NUMBER: extracted from AF101055 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 671 

ATGCCAGAAC ATGTAGGTAA AATTGTTCAG GTAATAGGAC CTGTTGTGGA 50 

TATTAAATTT GATGCAGAGA ACCTTCCTGA CATCTATAAT TCCATAGAAA 100 

TAGATATGGG AGATAATAAA AAACTCATTG CTGAAGTTGA ACAACATGTA 150 

55 GGAGATGACA TAGTAAGAAC AATAGCAATG GAAGGTACTG ACGGATTAAA 200 

AAGAGGAATG GAAGCAGTTA ACACTGGTAA ACCAATATCT GTACCAGTTG 250 

GAGAAAATGT TTTAGGACGT CTTTTTAATG TTTTAGGTCA GACAATAGAT 300 

GAAGCAGGAG ACATGAATGC TGATAAGTAT TATCCAATTC ATAGACCAGC 350 

TCCAACCTTT GAAGAACAAT CAGTTCAACC AGAAATGTTT GAAACAGGTA 4 00 

60 TTAAGGTTAT AGATTTACTT GCTCCATATC AAAAGGGTGG AAAAATCGGT 450 

379 
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TTGTTCGGTG GTGCCGGTGT TGGTAAAACA GTTCTTATTC AGCS^ACTTAT ioO 

AAATAATATA GCAAAAGAAC ACGGTGGATT ATCAGTATTC ACAGGTGTTG 550 

GAGAAAGAAC AAGAGAAGGT AATGACCTTT ATTATGAAAT GAAAGATTCA 600 

GGAGTTATAA ATAAAACAGC TCTAGTATTT GGTCAGATGA ATGAACCACC 650 

5 TGGCGCTAGA ATGAGAGTTG CTTTAACAGG ACTTACAATG GCTGAATATT 700 

TTAGAGACAA AGGTCAAGAT GTGCTTCTAT TTATAGATAA TATATTCAGA 750 

TTTACACAAG CTGGTTCAGA GGTTTCAGCT TTACTTGGTA GAATACCTAG 800 

TGCCGTTGGT TATCAGCCAA CACTTGCAAA TGAAATGGGT GCTCTTCAAG 850 

AGAGAATAAC ATCAACAAAA CAGGGTTCAA TCACATCCGT TCAGGCTGTA 900 

10 TATGTTCCTG CTGATGACCT TACAGACCCA GCTCCAGCAA CAACATTTAC 950 

GCATCTTGAT GCAACAACAG TTCTTTCAAG AGAAATATCA AACTTAGGAA 1000 

TATATCCTGC AGTTAGTCCT CTTGAATCAA CTTCAAGAAT ACTTGATCCA 1050 

AGAATTGTTG GAGAAGAGCA TTATGAAGTT GCTAACAAGG TTAAACATAT 1100 

ACTTGAAAGA TATCAAGAAC TTCAAGATAT CATAGCTATA CTTGGTGTTG 1150 

15 ATGAACTTTC AGATGAGGAT AGATTGTTAG TTGGAAGAGC AAGAAGAGTA 12 00 

CAGAGATTCT TATCTCAAGC TTTTAGTGTT GCTGAACAAT TTACAGGAAT 1250 

GAAAGGTCAG TTTGTACCTG TAAAAGATAC TATAAGAAGT TTTAAAGAAA 13 00 

TATTAGATGG TAAGTGTGAT GATCTTCCAG AAGCTGCATT TTTATTTGCA 13 50 

GGAACAATAG AAGATGTAAA AGAAAAAGCT AAAAAAATGA TGGAAAGCTA 14 00 

20 A 1401 



2 ) INFORMATION FOR SEQ ID NO: 672 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1509 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
30 "(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Cytophaga lytica 

(B) STRAIN: DSM 2039 

(C) ACCESSION NUMBER: M22535 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 672 



ATGTCTAAAG TTACAGGTAA AGTTTCCCAA ATTATTGGCC CAGTTATAGA 50 

TGTGGAGTTC CAAGCAGGGG TAGATCTTCC AAAAATTTAT GATTCATTAG 100 

AAATTAAAAA AGCAGATGGA TCAATTTTGG TTTTGGAAGT ACAATCACAC 150 

ATTGGTGAGA ACACAGTAAG AACTATATCT ATGGATTCTT CTGATGGTTT 2 00 

45 AAGTCGTGGA GCAGAGGTTA ATGCAACAGG AAGCGCTATA CAAATGCCAG 2 50 

TTGGAGATGA CGTTTACGGA CGTTTATTTA ACGTAATTGG AGACGCTATT 3 00 

GATGGTCTTG GGAATTTACC AAAATCTGGT AAAGATGGTT TGC CAATACA 350 

CAGAGAGGCA CCAAAATTTG AAGACTTATC TACTTCTACA GAAGTATTAT 4 00 

TTACAGGTAT TAAAGTAATT GACCTTATTG AGCCTTATGC AAAAGGTGGT 450 

50 AAGATTGGTT TATTTGGAGG TGCCGGAGTA GGTAAAACAG TATTAATTCA 500 

GGAATTAATT AACAACATTG CAAAAGGTCA CGGTGGACTT TCTGTATTTG 550 

CTGGTGTAGG TGAGCGTACT CGTGAGGGTA ACGATTTACT ACGTGAGATG 600 

TTAGAGTCTG GTATTATTAA ATACGGAGAT GACTTTATGC ACTCTATGGA 650 

AGAAGGTGGT TGGGATTTAT CTAAAGTTGA TAAATCTGTA ATGAAAGATT 700 

55 CTAAAGCAAC CTTTGTATTT GGACAAATGA ATGAGCCACC AGGAGCACGT 750 

GCACGTGTTG CATTATCTGG TTTAACTATT GCAGAATATT TCCGTGATGG 800 

AGCAGGTGAA GGTCAAGGTA AAGATGTACT TTTCTTTGTG GATAACATTT 850 

TCCGTTTTAC ACAAGCTGGT TCTGAGGTAT CTGCATTACT TGGTCGTATG 900 

CCATCTGCGG TAGGTTACCA ACCAACATTA GCAACAGAAA TGGGTGCTAT 950 

60 GCAAGAGCGT ATTACATCAA CAAAAAGAGG TTCTATTACA TCTGTACAGG 1000 

380 
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CGGTTTACGT ACCTGCGGAT GATTTAACGG ATCCAGCACC GGCA^CTACC 1#50 

TTTGCTCACT TAGATGCAAC AACGGTATTG TCTCGTAAAA TTGCAGAGTT 1100 

AGGTATTTAC CCAGCGGTAG ATCCATTAGA TTCTACTTCT AGAATCTTAG 1150 

CTCCAGAAAT TTTAGGAAAA GATCACTACT CTTGTGCACA ACGTGTAAAA 1200 

5 GAGTTGTTAC AACGTTATAA AGAATTACAA GATATTATTG CTATCCTTGG 1250 

TATGGAAGAA TTATCTGAGG AAGATAAAAT GGCAGTTGGT AGAGCAAGAC 1300 

GTGTACAACG TTTCTTATCT CAGCCTTTCC ACGTAGCAGA GCAATTTACA 1350 

GGTCTTAAAG GTGTTTTAGT AGATATCAAG GATACTATTA AAGGATTTAA 1400 

TATGATTATG GATGGTGAAT TAGATCACTT ACCAGAATCT GCATTTAACC 1450 

10 TTAAAGGTAC TATTGAAGAA GCTATAGAAG CAGGAGAAAA AATGCTTGCT 1500 

GAAGCATAA 1509 



15 2) INFORMATION FOR SEQ ID NO: 673 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ehrlichia risticii 

(B) STRAIN: HRC-IL 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 673 



CCTAAAATAT ATGATGCATT ATATGTAAAA CTAGATAATG AAAATTTGTG 50 

TTTAGAAGTT TCACAAATTA TTGGAGATAA TGTTGTTAGA TGTATTGCAA 100. 

TGGGAGCTAC TTATGGATTA AATCGTGGTT TAGAAGTAGT ,TTGTTCAGGA 150 

AATCCAATTC AGGTTCCTGT AGGTGAACAA GTTTTAGGTA GAATGTTTAA 200 

35 TGTTGTTGGT AAAACAATTG ACAATCTTGA ATCTTTAGAT GATAAAAATA 250 

TAAAAATGAT GCCAATTCAT CGAAATCCAC CATCATTTGA AGAGCAATCC 300 

AATGAAATTG AAATTTTTGA AACAGGCATT AAAGTTATTG ATTTATTAAT 350 

TCCATATGCT AAAGGTGGTA AGATTGGATT ATTTGGTGGA GCAGGGGTTG 400 

GGAAAACGGT TCTTGTTCAA GAATTAATTC ACAATATCGC AAAAGGTCAT 450 

40 GGTGGTCTAT CTGTTTTTGC TGGAGTTGGT GAAAGAACTC GTGAAGGTAA 500 

TGACTTGTAT TATGAAATGA TTGAAGGTGG AGTTATAGAT AAAACAGCCT 550 

TAGTGTTTGG GCAAATGAAT GAACCTCCTG GCGCAAGAAT GCGCGTAGCA 600 

TTAACTGCTT TAACAATGGC TGAATATTTC CGTGATGTTC AAAACCAAGA 650 

TGTTTTGTTA TTTATTGATA ATATCTTTAG ATTTACACAA GCTGGTAGTG 700 

45 AAGTTTCAGC ATTATTAGGA AGAATGCCAT CTGCTGTTGG TTATCAACCA 750 

ACTTTGGCAT ATGAAATGGG ATTGTTACAA GAAAGAATCA CTTCCACTAA 800 

AAGTGGTTCT ATAACATCT 819 



50 



60 



2) INFORMATION FOR SEQ ID NO: 674 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 840 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

381 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Vibrio cholerae 

(B) STRAIN: ATCC 25870 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 674 

AGAGCGAAGT ACCAAGTGTT TACGATGCTC TGAATGTTGT GGATTCCAAA 50 

GAACGTCTGG TTCTGGAAGT TCAACAGCAG CTAGGCGGTG GCGTGATTCG 100 

CGCTATCGTT ATGGGTAGCT CGGATGGTTT ACGTCGTGGA ATGACAGTAC 150 

10 AAAACACTGG CGCTCCAATT TCAGTACCAG TAGGTACTAA AACCCTAGGT 200 

CGTATCATGA ACGTGCTTGG TGATGCGATT GACGAACGTG GCGACATTGG 250 

CGCAGAAGAG GTGTACTCGA TTCACCGTCC TGCTCCAAGC TACGAAGAAC 300 

AGTCTAGTGC AACTGAACTT TTGGAAACGG GTGTTAAGGT TATCGACCTG 350 

ATCTGTCCGT TTGCGAAAGG CGGTAAAATC GGTCTGTTCG GTGGTGCGGG 400 

15 TGTAGGTAAG ACCGTTAACA TGATGGAACT TATCAACAAC ATCGCGCTAC 450 

AGCACTCAGG TTTGTCAGTA TTTGCTGGGG TAGGTGAGCG TACTCGTGAG 500 

GGTAACGACT TCTACCACGA AATGCAGGAA GCGGGCGTTG TAAACGTTGA 550 

ACAACCAGAA CTGTCGAAAG TAGCGATGGT TTACGGTCAG ATGAACGAGC 600 

CACCAGGCAA CCGTCTGCGT GTAGCACTGA CTGGTCTGAC TATGGCGGAA 650 

20 AAGTTCCGTG ATGAAGGCCG TGACGTACTG CTGTTTATCG ACAACATCTA 700 

CCGTTACACC CTAGCGGGAA CGGAAGTATC TGCTCTGCTT GGCCGTATGC 750 

CTTCAGCGGT AGGTTACCAA CCAACACTGG CTGAAGAGAT GGGTGTTCTG 800 

CAAGAACGTA TCACGTCAAC CAAAAAAGGT TCTATCACCT 840 



25 



40 



60. 



2) INFORMATION FOR SEQ ID NO: 675 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Vibrio cholerae 

(B) STRAIN: ATCC 25870 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 675 



CGGCGGTATC CTAGTTGTAG CGGCAACTGA CGGTCCAATG CCACAAACTC 50 

GTGAGCACAT CCTGCTGGGT CGCCAAGTAG GTATTCCTTA CATCATCGTA 100 

45 TTCATGAACA AGTGTGACAT GGTTGACGAT GAAGAGCTTC TAGAGCTGGT 150 

AGAGATGGAA GTTCGTGAGC TGCTGTCTGA GTACGATTTC CCAGGTGATG 2 00 

ACCTGCCAGT AATCCAAGGT TCAGCACTAG GCGCGCTAAA CGGCGAAGCA 250 

CAGTGGGAAG CGAAGATTGT TGAGCTAGCA GAAGCACTGG ATACTTAYAT 3 00 

TCCAGAGCCA GAGCGTGCAG TAGACATGGC ATTCCTGATG CCAATCGAAG 3 50 

50 ACGTATTCTC AATCCAAGGT CGTGGTACAG TAGTAACTGG CCGTATCGAG 4 00 

CGCGGCATCC TGAAAGTGGG TGACGAAGTA GCGATCGTTG GTATCAAAGA 450 

GACAGTAAAA ACGACCTGTA CAGGTGTAGA GATGTTCCGT AAGCTGCTTG 500 

ACGAAGGTCG TGCAGGTGAG AACGTAGGTG CACTRCTACG TGGTACTAAG 550 

CGTGAAGAAG TAGAGCGTGG TCAAGTACTG GCGAAGCCAG GTTCAATCAC 600 

55 ACCACACACT AAGTTCGAAT CAGAAGTATA CGTACTGTCA AAAGATGAAG 650 

GTGGCCGTCA TACTCCATTC TTCAAAGGTT ACCGTCCACA GTTCTACTTC 7 00 

CGTACAACTG ACGTAACAGG CAGCATTGAG CTACCAGARG GCGTAGAAAT 750 

GGTAATGCCA GGCGACAACG TGAAGATGGT TGTAGACCTG ATTGCACCAA 800 

TCGCGATGGA CGAAGGTCTA CGCTTCGC 828 



382 
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2) INFORMATION FOR SEQ ID NO: 676 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 98 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania enriettii 
15 (B) STRAIN: ATCC 50120 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 676 

CTTCTCGGAT GGCGTGCCGC CGGTGCTGAC GGCGCTGGAC GTGACGGAGG 50 

2 0 AACTCGGGCG CGACGAGCCG CTGACGCTAG AGATCGTGCA ACACTTGGAC 100 

GCGCACACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TACTGAAGCT 150 

GAAGTCGAAG GTCGTGTCGA CCGGCGGGAA CATCTCTGTG CCTGTGGGCC 200 

GCGAGACGCT TGGCCGCATC TTCAACGTAC TGGGCGACGC GATTGACCAG 250 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CGGAGGCGCC 300 

25 GAAGCTGGCG GACCAGGCGG CGGAGGACAC GATCCTGACG ACCGGCATCA 350 

AAGTGATCGA TCTGATCCTG CCGTACTGCA AGGGCGGCAA GATCGGTCTG 4 00 

TTCGGCGGTG CCGGTGTGGG GAAGACTGTG ATCATCATGG AGCTGATCAA 450 

CAACGTCGCG AAGGGCCACG GTGGCTTCTC CGTGTTCGCC GGCGTTGGGG 500 

AGCGCACCCG CGAGGGGACG GATCTGTACC TGGAGATGAT GCAATCGAAG 550 

3 0 GTGATCGACC TGAAAGGTGA GTCGAAGTGT GTGCTTGTGT ACGGGCAGAT 600 

GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

TGGGCGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 700 

GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CAGCGCTGCT 750 

GGGCCGCATC CCCGCCGCTG TGGGCTACCA ACCGACGCTT GCGGAGGATC 800 

35 TCGGCATGTT GCAGGAGCGC ATCACGT CGA CGACGAAGGG GTCGATCACG 850 

TCTGTGCAGG CGGTGTACGT GCCTGCGGAT GATATCACGG ATCCCGCGCC 900 

GGCGACGACA TTCTCGCACC TGGACGCGAC GACGGTGCTG GACCGCGCGG 950 

TGGCGGAGTC TGGCATCTAC CCCGCGGTCA ACCCACTGGA GTGCGCGTCG 1000 

CGTATCATGG ACCCTGATGT GATCGACGTG GATCACTACA ACGTTGCGCA 1050 

40 GGACATTGTT CAGATGC TGA CCAAGTACAA GGAGCTGCAG GATATCATTG 1100 

CGGTGCTTGG TATCGACGAG CTGAGCGAGG AGGACAAGGT TGTGGTGGAC 1150 

CGTGCGCGCA AGGTGACCCG GTTCCTGTCA CAGCCGTTCC AGGTTGCGGA 12 00 

GGTGTTTACT GGCATGACAG GCCATTACGT ACAGCTGAGC GACACGGTGG 1250 

AGTCGTTCTC CGGTTTGCTG ATGGGGTCGT ACGACCAGAT TCCGGAGA 1298 

45 



2) INFORMATION FOR SEQ ID NO: 677 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1083 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia microtti 
60 (B) STRAIN: Persing-1 

383 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 677 

CAAGCTCAAG TCTGAGCGTG AGAGAGGTAT TACTATTGAC ATTACTCTCT 50 

5 GGAAATTTGA GACCCAGAAA TACGAGTACA CTGTCATAGA CGCACCTGGT 100 

CATCGTGACT TTATCAAAAA TATGATTACT GGGACTTCAC AAGCCGACGT 150 

TGCTATGCTT GTCGTTCCTG CTGAATCTGG CGGATTCGAG GCTGCTTTTT 200 

CCAAAGAAGG TCAGACCCGT GAACACGCCT TACTAGCCTT CACACTTGGC 250 

GTCAAACAGA TGATTGTTGC TATTAACAAA ATGGATTCTT GTCAGTACAA 300 

10 GGAGGATCGT TATATGGAAA TTTTCAAGGA AGTACAGCAG TACTTGAAGA 350 

AGGTGGGTTA CAAAGTTGAA AGCGTGCCGT TTGTTGCTAT TTCAGGATTC 400 

CACGGTGACA ACATGGTTGA AAAATCTACT AACATGCCTT GGTATAAGGG 450 

TAAGACCCTC GTAGAGGCAC TTGATCAAAT GGAGCCTCCA AAACGTCCGG 500 

TCGAAAAACC TCTTAGATTG CCCCTGCAGT CAGTCTATAA AATTGGAGGT 550 

15 ATTGGTACGG TACCAGTCGG AAGGGTCGAA ACAGGACAAC TGAAAGCAGG 600 

AATGATCATT ACTTTTGCCC CCACTGGTTT GACCACTGAA TGTAAATCTG 650 

TTGAAATGCA TCACGAGGTT GTGGAAGTGG CTAGCCCCGG TGATAACGTT 700 

GGATTTAATG TCAAGAATGT GTCTGTTAAG GATATTAAGA GAGGAAATGT 750 

GGCTTCGGAT TCGAAAAATG ACCCAGCCAA GGAAGCTACC TCTTTCTCTG 800 

20 CACAAGTCAT TGTACTCAAT CACCCTGGTA CCATCAAGGC CGGTTACTCA 850 

CCTGTGGTTG ATTGCCATAC TGCCCACATT GCTTGCAAAT TCGAATCTCT 900 

AGACACTAGG ATTGACAAGC GTACTGGCAA GACTTTGGAA GAAAATCCTA 950 

AGACTATTAA GAATGGTGAC GCTGCCATGG TGACTATGAA ACCAAATAAA 1000 

CCCATGGTTG TGGAAACTTT CACCGACTAC GCCCCGTTGG GCCGGTTCGC 1050 

25 CGTGCGTGAT ATGCGCCAAA CCGTTGCCGT CGG 1083 



30 



2) INFORMATION FOR SEQ ID NO: 678 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 551 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Cryptococcus neoformans 

(B) STRAIN: Lev- 12 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 678 

45 TGTGCTCATT CAGGAGCTCA TTAACAACAT TGCCAAGGCT CACGGTGGTT 50 

ACTCTGTCTT CACTGGTGTC GGTGAGCGAA CTCGTGAGGG TAACGACTTG 100 

TACCACGAAA TGAGGGAGAC TGGTGTTATC AACCTTGAGG GTGACTCCAA 150 

GGTCGCTCTT GGTGAGTTCT TTTTTTTCTT CAGGCTAATT AGTCGATGAC 2 00 

GTGGGCCCTG ACTAAAACTG TTTCTTCCAG TCTTCGGTCA GATGAACGAG 250 

50 CCCCCTGGAG CCCGTGCCCG AGTTGCCCTT ACTGGTTTGA CCATTGCCGA 3 00 

GTACTTCCGT GACGAGGAAG GCCAGGATGT GTTGCTTTTC ATTGACAACA 3 50 

TTTTCCGATT CACCCAGGCC GGTTCCGAGG ■ TGTCTGCCTT GCTCGGTCGT 4 00 

ATCCCCTCTG CCGTCGGTTA CCAGCCCACT CTTTCCACCG ACATGGGTGG 4 50 

TATGCAGGAG CGAATGTAGG TTGCATTCTC TGTGATTTTA CGGCAAGCCT 500 

55 TGACTTTTTT TTTCTAGTAC CACCACCAAG AAGGGTTCCA TTACCTCCGT 550 

C 551 



60 2) INFORMATION FOR SEQ ID NO: 679 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 552 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus neoformans 

(B) STRAIN: ATCC 44104 



15 



30 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 679 



TGTGCTCATT CAGGAGCTCA TTAACAACAT TGCCAAGGCC CACGGTGGTT 50 

ACTCCGTCTT CACCGGTGTC GGTGAGCGAA CTCGTGAGGG TAACGACTTG 100 

TACCACGAAA TGAGGGAGAC TGGTGTTATC AACCTTGAGG GTGACTCCAA 150 

GGTCGCTCTT GGTGAGTTCT TTTTTTTGGG AGCTAATTAG TCGATGACGT 200 

20 GGGCCCTGAC CAAAACCGTT TCTTTCAGTC TTCGGTCAGA TGAACGAGCC 250 

CCCTGGAGCC CGTGCCCGAG TTGCTCTTAC TGGTTTGACC ATTGCCGAGT 300 

ACTTCCGTGA CGAGGAAGGC CAGGATGTGT TGCTTTTCAT TGACAACATT 350 

TTCCGATTCA CCCAGGCCGG TTCCGAGGTG TCTGCCTTGC TCGGTCGTAT 4 00 

CCCCTCTGCC GTCGGTTACC AGCCCACTCT TTCCACCGAT ATGGGTGGTA 450 

25 TGCAAGAGCG AATGTAAGTT GCATTTTTTG TGATTTTACG GCAAGTCTTG 500 

ACTTACATTT TTTTCTAGCA CCACCACCAA GAAGGGTTCC ATTACCTCCG 550 

TC 552 



2 ) INFORMATION FOR SEQ ID NO: 680 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1018 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cunninghamella bertholletiae 

(B) STRAIN: ATCC 42115 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 680 

TCTCCCTGCT ATTTTAAACG CTCTTGAAGT TAAGGATCAT GCTGGTGGTC 50 

GTCTTGTTCT TGAAGTTGCT CAACACTTGG GTGAAAACAC TGTACGTACT 100 

ATTGCTATGG ATGGTACTGA AGGTAAGTTT ATTTTTAGAT GATCATAAAT 150 

50 AATTGATCAT AATGATAAAA AAAAAAGAAG AAGAAGAACA GGATGTATAT 200 

AATGGTTAAT AAATAATATT TTCATATTGK ATATAACTAT TTAATCTGTT 250 

TTTTTTTCTT CATGATTATA TATATATATG TMCTAATATC TAATATGAAC 300 

CTTTTTTATA AAATTAATCA GGTCTTGTCC GTGGTCAAAA GGTTGTTGAT 350 

ACTGGTGCTC CTATTACCAT TCCTGTTGGT AAGGAAGTTC TTGGTCGTAT 4 00 

55 CATCAACGTT ATTGGTGAAC CCATTGATGA ACGTGGTCCT ATTAAGGCCA 450 

AGTCTCACCG TGCTATTCAC GCTGAAGCTC CAGAATTCGT TGATCAATCT 500 

CCTACTCCCG AAATTCTTGA AACTGGTATT AAGGTTGTCG ATTTATTAGC 550 

TCCTTATGCT CGTGGTGGTA AGATTGGTCT TTTCGGTGGT GCTGGTGTAG 600 

GTAAAACTGT CTTGATTCAA GAACTTATTA ACAACATTGC TAAAGCCCAT 650 

60 GGTGGTTACT CTATTTTCTG TGGTGTTGGT GAACGTACTC GTGAAGGTAA 700 

385 
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CGATTTATAC CACGAAATGA TGGAAACTGG TGTCATTAAA CTTGkAGGTG 750 

ACTCCAAGTG TGCTCTTGTA TTCGGTCAAA TGAACGAACC TCCTGGTGCT 800 

CGTGCCCGTG TTGCTTTAAC TGGTTTAACC ATTGCTGAAT ACTTCCGTGA 850 

TGAAGAAGGT CAAGATGTGT TACTTTTCAT TGATAACATT TTCCGTTTCA 900 

CTCAAGCTGG TTCTGAAGTA TCTGCCCTTT TAGGTCGTAT TCCATCTGCT 950 

GTAGGTTACC AACCCACTTT ATCTACTGAT ATGGGTGGTA TGCAAGAACG 1000 

TATTACTACT ACCAAGAA 1018 



2 ) INFORMATION FOR SEQ ID NO: 681 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 681 
GGISSITTYG GIISIGGIAA RAC 23 



2 ) INFORMATION FOR SEQ ID NO: 682 



(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 682 
GTIACIGGYT CYTCRAARTT ICC ICC 26 

40 

2) INFORMATION FOR SEQ ID NO: 683 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 683 
55 GTIACIGGIT CISWIAWRTC ICCICC 26 



2) INFORMATION FOR SEQ ID NO: 684 

386 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3267 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Candida tropicalis 

(C) ACCESSION NUMBER: M64984 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 684 

15 ATGGCTGGTG CTTTAGAAAA CGCAAGAAAA GAAATTAAAC GTCTTTCATT 50 

AGATGACACC AATGAATCCC AATATGGTCA AATCTATTCT GTTTCCGGTC 100 

CGGTTGTTAT TGCCGAAAAC ATGATTGGAT GTGCCATGTA CGAATTGGTT 150 

AAAGTTGGTC ATGATAATTT AGTTGGGGAA GTTATTAGAA TTAATGGTGA 200 

TAAAGCAACC ATTCAAGTTT ATGAAGAAAC TGCAGGGGTC ACTGTTGGTG 250 

20 ATCCAGTTTT AAGAACTGGT AAAC CATTAT CTGTTGAATT AGGTCCTGGT 300 

TTAATGGAAA CTATTTATGA TGGTATTCAA AGACCTTTAA AAGCCATTAA 350 

' AGATGAATCC CAATCTATTT ATATCCCAAG AGGTATTGAT GTTCCTGCTT 400 

TATCAAGAAC TGTT CAATAT GATTTCACTC CAGGTCAATT GAAAGTTGGT 450 

GATCATATCA CTGGTGGGGA CATTTTTGGT TCTATTTATG AAAACTCTTT 500 

25 ATTGGATGAC CATAAGATTT TGTTACCTCC AAGAGCAAGA GGTACTATTA 550 

CTTCTATTGC TGAAGCCGGT TCTTATAATG TTGAAGAACC AGTTTTGGAA 600 

GTTGAATTTG ATGGTAAGAA ACATAAATAC TCTATGATGC ATACATGGCC 650 

AGTTAGAGTT CCAAGACCAG TTGCTGAAAA ATTGACTGCT GATCATCCAT 700 

TGTTGACCGG TCAAAGAGTC TTGGATTCTT TATTCCCATG TGTTCAAGGT 750 

30 GGTACTACTT GTATCCCAGG GGCTTTTGGT TGTGGTAAAA CTGTTATTTC 800 

TCAATCTTTG TCCAAATTCT CCAACTCTGA TGTTATTATC TATGTTGGTT 850 

GTTTCACTAA AGGTACTCAA GTCATGATGG CTGATGGTGC CGACAAATCT 900 

ATTGAATCTA TTGAAGTTGG TGACAAAGTC ATGGGTAAAG ATGGTATGCC 950 

AAGAGAAGTT GTTGGCTTAC CAAGAGGTTA TGATGATATG TACAAGGTTC 1000 

35 GTCAACTTTC TAGTACTAGA CGTAATGCTA AATCCGAAGG CTTGATGGAT 1050 

TTCACTGTTT CTGCTGATCA TAAACTTATC TTGAAAACTA AACAAGATGT 1100 

CAAGATTGCT ACACGTAAAA TTGGTGGCAA CACCTATACT GGTGTTACTT 1150 
TCTATGTTTT GGAAAAGACT AAGACTGGTA TTGAATTAGT TAAAGCCAAG . 12 00 

ACTAAAGTTT TCGGTCATCA TATCCATGGT CAAAATGGCG CTGAAGAAAA 1250 

40 AGCTGCTACT TTTGCTGCTG GCATTGACTC TAAAGAATAC ATTGATTGGA 1300 

TCATTGAAGC TAGAGATTAT GTACAAGTTG ATGAAATTGT CAAGACCAGC 1350 

ACCACTCAAA TGATCAACCC AGTTCATTTT GAATCTGGTA AACTCGGTAA 1400 

CTGGTTACAC GAACACAAGC AAAACAAATC ACTTGCTCCA CAATTGGGTT 1450 

ACTTGTTGGG TACTTGGGCT GGTATTGGAA ATGTTAAATC TTCTGCTTTC 1500 

45 ACCATGAACT CCAAAGATGA TGTTAAATTA GCTACAAGAA TTATGAACTA 1550 

CTCTTCAAAA TTGGGCATGA CTTGTTCTTC TACTGAATCC GGTGAACTCA 1600 

ATGTCGCTGA AAACGAAGAA GAATTTTTCA ATAACCTTGG TGCTGAAAAG 1650 

GATGAAGCTG GTGATTTCAC TTTTGATGAA TTTACCGATG CTATGGATGA 1700 

ATTGACTATC AATGTTCATG GTGCAGCTGC AAGCAAGAAG AACAATTTGT 1750 

50 TGTGGAATGC TTTGAAATCT CTTGGTTTCA GAGCCAAGTC TACTGATATT 1800 

GTCAAGAGTA TTCCTCAACA TATTGCTGTT GATGATATTG TTGTCAGAGA 1850 

ATCTTTGATT GCCGGTTTAG TTGATGCTGC TGGTAATGTT GAAACCAAAT 1900 

CCAATGGTTC TATTGAAGCT GTTGTTAGAA CTTCTTTCAG ACATGTCGCT 1950 

AGAGGTCTTG TCAAGATTGC TCATTCTTTG GGTATTGAAT CATC TATTAA 2000 

55 TATTAAAGAT ACTCACATTG ATGCTGCTGG TGTTAGACAA GAATTTGCTT 2050 

GTATTGTCAA TTTGACTGGT GCTCCACTTG CTGGTGTTCT TTCTAAATGT 2100 

GCACTTGCAA GAAACCAAAC TCCAGTTGTC AAATTTACCA GAGACCCAGT 2150 

TTTGTTCAAC TTTGATTTGA TCAAATCTGC AAAAGAAAAC TATTATGGTA 22 00 

TTACTTTGGC TGAAGAAACT GATCATCAAT TCCTTTTATC CAACATGGCC 2250 

60 TTGGTGCACA ACTGTGGTGA ACGTGGTAAT GAGATGGCTG AAGTTTTGAT 2300 

387 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

GGAATTCCCA GAATTGTTTA CTGAAATTTC TGGTAGAAAA GAACCAATTA 2350 

TGAAACGTAC CACTTTGGTT GCCAATACTT CTAATATGCC AGTCGCTGCC 2400 

AGAGAAGCTT CTATTTATAC TGGTATTACA TTGGCTGAAT ATTTCAGAGA 2450 

TCAAGGTAAG AATGTTTCTA TGATTGCTGA TTCTTCTTCA CGTTGGGCTG 2500 

AAGCTTTGAG AGAAATTTCT GGTAGATTGG GTGAAATGCC TGCTGATCAA 2550 

GGTTTCCCAG CTTATTTGGG TGCTAAATTG GCTTCTTTCT ATGAGCGTGC 2600 

CGGTAAAGCC ACTGCTTTGG GTTCACCAGA TAGAGTTGGT TCAGTTTCTA 2650 

TTGTTGCTGC TGTTTCTCCA GCTGGTGGTG ATTTCTCTGA TCCAGTTACT 2700 

ACTTCTACTT TGGGTATTAC TCAAGTTTTC TGGGGGTTGG ATAAGAAATT 2750 

GGCCCAAAGA AAACATTTCC CATCTATTAA CACCAGTGTT TCTTATTCTA 2 80 0 

AATACACCAA TGTTTTGAAC AAATACTATG ATTCCAACTA TCCAGAATTC 2850 

C CACAATTGA GAGACAAAAT TAGAGAAATT TTATCTAATG CTGAAGAATT 2900 

GGAACAAGTT GTTCAATTAG TTGGTAAATC TGCATTGTCT GATTCTGATA 2 950 

AGATTACTTT AGATGTTGCT ACCTTGATTA AAGAAGATTT CTTGCAACAA 3 000 

AATGGTTATT CTTCATATGA TGCATTCTGT CCAATTTGGA AGACTTTTGA 3 050 

TATGATGAGA GCATTTATTT CATATTATGA TGAAGCACAA AAAGCAATTG 3100 

CCAATGGTGC TCAATGGTCT AAATTAGCTG AAAGTACTAG TGATGTTAAA 3150 

CATGCTGTTT CTTCAGCTAA ATTCTTTGAA CCATCAAGAG GTCAAAAAGA 32 00 

AGGTGAAAAA GAATTTGGAG ATTTATTAAC CACTATCTCC GAAAGATTTG 3250 

CTGAAGCTTC AGAATAA 3267 



2) INFORMATION FOR SEQ ID NO: 685 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1782 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY.: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus hirae 

(B) STRAIN: ATCC 9790 

(C) ACCESSION NUMBER: D17462 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 685 

TTGCAAATTG GAAAAATCAT AAAAGTCTCC GGTCCTCTCG TTATGGCAGA 50 

AAATATGTCA GAAGCAAGTA TTCAAGACAT GTGTTTAGTG GGAGATTTAG 100 

GAGTCATCGG CGAAATCATT GAGATGCGTC AAGATGTGGC GTCTATTCAA 150 

GTATATGAAG AAACTTCAGG AATTGGTCCC GGAGAACCTG TTCGTTCCAC 200 

TGGGGAAGCA CTATCTGTTG AGCTAGGACC AGGAATCATT TCACAAATGT 250 

TTGACGGGAT TCAAAGACCA CTGGATACAT TTATGGAAGT GACTCAAAGT 3 00 

AACTTCTTAG GACGTGGGGT CCAATTACCA GCTTTAGATC ATGAGAAACA 350 

ATGGTGGTTT GAAGCGACAA TCGAAGAAGG AACAGAAGTA AGTGCTGGAG 400 

ACATCATTGG GTACGTGGAT GAAACGAAGA TCATTCAGCA CAAAATCATG 450 

GTCCCTAATG GTATCAAAGG AACTGTACAA AAAATTGAAT CTGGATCATT 500 

TACGATCGAT GATCCGATTT GTGTGATCGA AACGGAACAA GGCTTAAAAG 550 

AGCTGACGAT GATGCAAAAA TGGCCAGTAC GTCGTGGTCG ACCAATCAAA 600 

CAAAAATTAA ATCCAGATGT ACCGATGATC ACCGGTCAAA GGGTCATTGA 650 

CACGTTTTTC CCAGTAACTA AAGGAGGAGC GGCAGCCGTT CCAGGTCCGT 700 

TTGGTGCAGG GAAGACAGTT GTGCAACACC AGATTGCTAA GTGGTCGGAC 750 

GTAGATCTAG TGGTTTACGT TGGTTGTGGG GAACGAGGAA ATGAAATGAC 800 

GGATGTCGTC AATGAATTTC CTGAACTGAT CGATCCAAAT ACAGGCGAGT 850 

CTTTGATGGA ACGAACTGTG TTGATCGCTA ATACATCGAA CATGCCAGTA 900 

GCTGCTCGAG AAGCTTCTAT TTATACGGGA ATCACGATTG CCGAGTACTT 950 

CCGTGACATG GGGTATGATG TAGCAATCAT GGCAGATTCC ACTTCTCGTT 1000 

388 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GGGCAGAAGC ACTGCGTGAA ATGAGCGGAC 
GATGAAGGTT ATCCCGCTTA TCTGGGCTCT 
ACGTTCAGGA CGTGTCATTG CTCTAGGCTC 
TCACTGCCAT CAGTGCGGTT TCTCCTTCTG 
5 GTGACTCAAA ATACCTTACG TGTGGTGAAG 
TAGTCTTGCT CAAAAAAGAC ATTTTCCATC 
ACTCATTATA TTCAACAGAA GTTGGCAGAT 
CAGGATTGGT CTGATATGGT AACTGAAGGC 
AGAACAATTA AATGAAATCG TGCGCTTGGT 

10 ATAACGATCG CTTGACCCTT GAAGTAGCAA 
TTACAACAAA ATGCTTTTGA TGATGTAGAT 
ACAATTCAAC ATGTTGAAAG TTATTTTGAC 
AAGCCTTATC GTTGGGAGCG TACTTCAATG 
GCGGTCAGAG AACGCATTAG TCGGAGCAAG 

15 AGCCAAAATT AGTAGTATAA ATGAAGAAAT 
TTGTTTCAGA AGGAGGGATG ACCGATGATT 



PCT/CA00/01150 

GTTTAGAAGA AATGCGTGGT 105CT 

CGTCTAGCTG AATACTATGA 1100 

TGACCAACGT GAGGGCAGTA 1150 

GTGGAGATAT CTCTGAACCA 1200 

GTTTTCTGGG GATTAGATTC 1250 

GATTAACTGG ATCCAAAGTT 1300 

ATATGGATCA AATCTTACAA 1350 

ATGCGGATCT TGCAAGAAGA 14 00 

AGGGATCGAT TCGCTTTCTG 1450 

AATCGATTCG AGAAGACTAT 1500 

ACGTTTACTT CAAGAGAAAA 1550 

TTTTGGGAAA GAAGCTCGAA 1600 

AAATCATGGA AGGTACAGTA 1650 

TATATTCCAG AAGAAGAGTT 1700 

CAAAGAAACG ATCCAATTGA 1750 

AA 1782 



20 2) INFORMATION FOR SEQ ID NO: 686 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1781 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia pneumoniae 

(B) STRAIN: CWL 02 9 . 

(C) ACCESSION NUMBER: Genome project '■ 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 686 

CAGGGACATG TTATAGAAGC TTATGGAAAC TTGTTACGTG TACGCTTTGA 50 

CGGATATGTT AGACAAGGTG AAGTTGCATA TGTCAACGTA GATAATACCT 100 

GGTTAAAAGC AGAAGTGATT GAAGTTGCTG ATCAAGAAGT CAAGGTTCAG 150 

40 GTATTTGAAG ATACACAAGG CGCGTGTCGA GGAGCTCTTG TTACGTTTTC 200 

AGGACATCTT TTAGAAGCCG AGTTAGGGCC TGGCTTGCTT CAGGGCATTT 250 

TCGATGGACT TCAAAATCGT CTTGAGGTGC TAGCTGAAGA TAGTTCTTTC 3 00 

TTGCAGAGAG GCAAGCATGT TAATGCTATT TCTGATCATA ATT TATGGAA 350 

TTATACTCCC GTAGCTTCTG TTGGGGATAC TTTAAGACGA GGAGATCTTC 400 

45 TAGGAACAGT ACCTGAAGGA CGATTTACTC ATAAGATTAT GGTTCCTTTT 450 

TCTTGCTTTC AAGAGGTTAC CCTGACTTGG GTAATTTCTG AAGGAACCTA 500 

TAATGCTCAT ACTGTGGTCG CAAAAGCTCG AGATGCTCAG GGTAAAGAAT 550 

GTGCCTTTAC TATGGTGCAA AGATGGCCGA TCAAACAAGC TTTTATTGAA 600 

GGAGAGAAGA TCCCTGCGCA TAAGATTATG GATGTGGGTT TGCGAATCTT 650 

50 AGATACGCAA ATTCCAGTAT TGAAGGGGGG AACTTTCTGT ACCCCAGGAC 700 

CTTTTGGTGC AGGGAAAACA GTCTTACAAC ACCATCTTTC TAAGTACGCT 750 

GCTGTAGATA TTGTGATTTT GTGTGCGTGC GGAGAGCGTG CTGGTGAAGT 800 

TGTTGAGGTA TTACAAGAGT TCCCTCATCT TATCGACCCC CATACCGGAA 850 

AGTCTTTAAT GCACAGAACA TGTATTATTT GTAACACATC ATCCATGCCT 900 

55 GTGGCTGCCC GAGAGTCTTC GATCTATTTA GGAGTGACGA TTGCAGAATA 950 

CTATCGCCAG ATGGGACTAG ATATTCTGCT TTTAGCTGAT TCTACATCCC 1000 

GATGGGCACA AGCCCTTAGA GAGATTTCGG GACGTCTTGA AGAAATCCCT 1050 

GGAGAGGAAG CATTTCCTGC ATACCTGTCT TCTAGAATAG CTGCTTTTTA 1100 

TGAGCGAGGA GGAGCTATCA CCACGAAAGA TGGTTCTGAA GGATCTTTAA 1150 

60 CTATATGTGG TGCGGTGTCT CCTGCAGGAG GAAACTTTGA AGAACCAGTC 1200 

389 
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ACTCAATCTA CATTAGCTGT AGTCGGAGCG 
ACGAGCTGAC GCACGTAGGT ATCCTTCAAT 
CAAAATATTT GAACCAGGTA GGACAAATTT 
TGGGGTGGTG CTGTGAAAAA AGCAGCACAG 
5 AATCGGCAAG CGTATGGAAG TTGTCAGTGA 
ACATGGAAAT CTACTTAAAG GCAGAACTTT 
CAGAACGCAT TCGATCCTGT GGACTGTTAT 
AGAGTTATTT TCATTAATCA GTCGTATTTT 
ATAGTCCTGA TGATGCAAGA AGCTTTTTCC 
10 AAGACATTAA ATGGCCTGAA ATTTCTTTCA 
AGAGGTCATA GTTAGACTGT TGGAAAAAAC 
GATATGCAAA CAATCTACAC AAAAATAACT 



PCT/CAOO/01150 

TTCTGTGGTC TTTCAAAAGC 1250 

AGACCCTTTG ATTTCTTGGT 1300 

TAGAAGAGAA GGTTTCAGGC 1350 

TTTCTAGAGA AAGGTTCAGA 1400 

AGAAGGGGTT TCTATGGAAG 1450 

ATGATTTTTG TkATCTCCAG 1500 

TGTCCTTTTG AGAGACAGAT 1550 

TGATGCTAAA TTTGTTTTTG 1600 

TTGAGCTGCA GAGCAAGATT 1650 

GAGGAATATC ATGAGAGTAA 1700 

AATGGTACAA ATGGCGTAAG 1750 

G 1781 



15 



2) INFORMATION FOR SEQ ID NO: 687 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1758 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Halobacterium salinarum 
(C) ACCESSION NUMBER: S56356 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 687 

ATGAGTCAAG CTGAAGCAAT CACTGACACC GGCGAAATCG AGAGCGTGAG 50 

CGGTCCGGTC GTGACCGCCA CGGGCCTCGA CGCGCAGATG AACGACGTCG 100 

TCTACGTGGG CGACGAGGGT CTGATGGGCG AGGTCATCGA GATCGAAGGC 150 

35 GACGTAACCA CCATCCAGGT CTACGAGGAG ACGTCCGGCA TCGGGCCGGG 2 00 

CCAGCCCGTC GACAACACGG GCGAACCGCT CACCGTGGAC CTGGGCCCGG 250 

GGATGCTGGA CTCCATCTAC GATGGTGTGC AGCGTCCACT GGACGTCCTC 300 

GAAGACGAGA TGGGGGCGTT CCTCGATCGC GGTGTTGACG CACCCGGCAT 350 

CGACCTCGAC ACCGACTGGG AGTTCGAGCC CACCGTCGAG GCGGGCGACG 4 00 

40 AGGTCGCGGC CGGCGATGTC GTCGGCACCG TCGACGAAAC GGTCAGCATC 450 

GAACACAAGG TTCTGGTGCC CCCCCGCAGT GACGGCGGCG AAGTCGTCGC 500 

CGTCGAATCC GGCACGTTCA CCGTCGACGA CACGGTCGTC GAGTTGGACA 550 

CCGGCGAGGA GATCCAGATG CACCAGGAGT GGCCGGTCCG CCGCCAGCGC 600 

CCCACCGTCG ACAAGCAGAC GCCGACGGAG CCGCTGGTGT CCGGCCAGCG 650 

45 CATCCTCGAC GGCCTGTTCC CGATCGCGAA AGGCGGGACG GCCGCGATCC 700 

CGGGGCCGTT CGGGTCCGGG AAGACGGTCA CCCAGCAGTC CCTCGCGAAG 750 

TTCGCCGACG CGGACATCGT TGTCTACATC GGCTGTGGTG AGCGCGGCAA 800 

CGAGATGACG GAAGTCATCG AGGACTTCCC GGAGCTGCCC GACCCCCAGA 850 

CCGGGAACCC GCTGATGGCC CGCACCACGC TCATCGCGAA CACGTCGAAC 900 

50 ATGCCGGTTG CCGGGCGTGA GTCCTGCATT TACACGGGAA TCACCATCGC 950 

GGAGTACTAC CGCGACATGG GCTACGACGT GGCGCTGATG GCCGACTCCA 1000 

CCTCGCGGTG GGCGGAGGCC ATGCGGGAGA TCTCCTCGCG ACTCGAGGAG 1050 

ATGCCCGGCG AGGAGGGGTA TCCCGCGTAC CTGGCCGCCC GCCTCTCGGA 1100 

GTTCTACGAG CGCGCCGGCT ACTTCGAGAA CTTCAACGGG ACCGAGGGCT 1150 

55 CCATCTCGGT CATCGGTGCG GTGTCGCCGC CCGGCGGGGA CTTCTCCGAG 1200 

CCGGTCACCC AGAACACGCT GCGCATCGTG AAGACGTTCT GGGCGCTTGA 1250 

CTCGGACCTC GCCGAGCGCC GGCACTTCCC GGCGATCAAC TGGGACGAGT 1300 

CCTACAGCCT CTACAAGGAC CAACTCGACC CGTGGTTCAC GGACAACGTC 1350 

GTCGACGACT GGGCCGAGCA GCGCCAGTCG GCGGTCGACA TCCTCGACGA 1400 

60 GGAATCCGAA CTCGAAGAGA TCGTGCAGCT CGTCGGGAAG GACGCGCTGC 1450 

390 
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CCGAGGACCA GCAGCTCACG CTGGAAGTCG 
TGGCTCCAGC AGAACGCGCT CCACGACGTG 
GAAGACGTAC GCCATCCTCT CCGGCATCAA 
TTGAGGCGTT GGACGCCGGT GTGCCAGTCG 
5 GCCGCGCCGC GCCTGAATCG TCTCGGCACG 
GGCGGAGGTC GCGGAGATCA AACAGCAGAT 
TCTACTGA 



PCT/CA00/01150 

CGCGGTACAT CCGGSAGGCG L500 

GATCGCTACT GCCCGCCCGA 1550 

GACGCTTCAC GAGGAGTCCT 1600 

AGGAGATCAC GTCGATCGAC 1650 

ACGCCCGACG ACGAGCACGA 1700 

TACCGAGCAG CTTCGGGAGC 1750 

1758 



10 



2) INFORMATION FOR SEQ ID NO: 688 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3118 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Human 

(C) ACCESSION NUMBER: L09234 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 688 

GAATTCCGGC AGCTGACTAG TCTTGTGATT GGGGTCCTGG GCTGATAAAA 50 

TCATTCCAAA TGACGAGCAC ATTGATAAAG ACGTCCGATG AGGACCGGGA 100 

GTCCAAATTC GGCTTTGTTT TTGCCGTATC TGGACCTGTG GTGACAGCTG 150 

30 AACGAATGGC CGGTTCTGCT ATGTACGAAC TGGTGCGTGT CGGTTATTAT 200 

GAACTGGTCG GAGAGATCAT CCGGTTGGAG GGTGACATGG CAACAATCCA 250 

AGTATACGAA GACACCTCAG GTGTGACAGT AGGCGATCCC GTGCTGCGCA 300 

CAGGCAAGCC GCTGTCCGTG GAACTGGGAC CCGGAATCAT GGGCAGCATC 350 

TTCGACGGTA TCCAGCGACC GCTGAAGGAT ATCAATGAAC TGTCAAATAG 400 

35 TATCTACATC CCGAAAGGTG TCAATGTGCC TGCCCTGAGT CGCACTGCAC 450 

AGTGGGACTT CAGTCCCGTC AGTGTCAAGG TTGGAAGCCA CATTACTGGT 500 

GGTGACCTGT ACGGTTTGGT CCACGAAAAT ACTCTGGTGA AACACAAGTT 550 

GCTGCTGCCG CCCCGTGCCA AGGGAACTGT CACGTACATT GCAGAACCTG 600 

GAAACTACAC AGTTGATGAT GTTGTCCTGG AGACAGAATT TGACGGCGAG 650 

40 CGATCAAAGT TCACCATGCT GCAAGTGTGG CCTGTACGTC AGCCCAGGCC 700 

TGTTACAGAA AAGTTGCCAG CTAACTACCC CCTCCTTACT GGCCAGCGTG 750 

TGCTCGACTC CCTATTCCCG TGTGTCCAGG GTGGAACAAC AGCTATTCCT 800 

GGGGCCTTCG GATGTGGCAA GACTGTAATA TCACAGTCTT TGTCAAAATA 850 

CTCAAACTCC GATGTAATTA TCTATGTAGG TTGTGGTGAG CGAGGTAATG 900 

45 AAATGTCAGA AGTACTCAGG GATTTCCCGC AGTTGTCGTT GGAGATTGAT 950 

GGTGTGACTG AATCAATCAT GAAGAGAACA GCCCTGGTCG CAAACACATC 1000 

AAACATGCCT GTGGCTGCTC GAGAAGCATC TATCTACACA GGTATTACAC 1050 

TGTCAGAATA CTTCAGGGAC ATGGGTTACA ATGTATCCAT GATGGCTGAC 1100 

TCAACTTCAC GATGGGCCGA AGCTCTTCGA GAAATCTCAG GTCGATTGGC 1150 

50 TGAAATGCCT GCCGACAGCG GTTATCCCGC CTACCTAGGT GCACGACTTG 1200 

CCAGTTTCTA CGAGCGTGCC GGCCGTGTGA AGTGCTTGGG TAACCCAGAC 1250 

AGGGAGGGCT CCGTGAGTAT AGTGGGCGCC GTGTCGCCGC CCGGTGGAGA 1300 

CTTCTCAGAT CCCGTGACGA CGGCCACACT AGGTATCGTC CAGGTGTTCT 1350 

GGGGTCTCGA CAAGAAACTT GCCCAGCGAA AGCACTTCCC ATCCATCAAC 1400 

55 TGGCTCATCT CGTACAGTAA ATACATGCGT GCTCTGGATG ACTTCTACGA 1450 

CAAGAATTTC CCAGAGTTTG TCCCACTGCG TACAAAGGTG AAGGAGATTT 1500 

TGCAGGAGGA AGAAGACCTG TCTGAAATTG TGCAGTTGGT CGGTAAAGCT 1550 

TCATTGGCAG AAACTGACAA GATCACACTT GAGGTTGCCA AACTATTAAA 1600 

GGATGATTTC CTGCAACAGA ACAGCTATTC ACCATATGAC CGTTTCTGCC 1650 

60 CATTCTACAA GACAGTAGGA ATGCTGAAAA ATATGATTGC TTTCTACGAT 1700 

391 
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ATGTCTCGGC ATGCAGTTGA ATCTACTGCT CAGAGCGAGA ACAA&ATCAC 17 5ft 

TTGGAATGTT ATTAGAGATT CTATGGGCAA TATTCTGTAT CAGCTTTCCT 1800 

CCATGAAATT CAAGGATCCA GTCAAGGATG GAGAAGCGAA GATCAAGGCA 1850 

GACTTTGAGC AGCTTCATGA AGACATTCAG CAAGCCTTCA GGAACCTGGA 1900 

5 GGATTAAAGT GGTAGCTGCC AGTGGTTCTC TCGGTGCAGT TGTCACATTT 1950 

GGCAAGCTCT GTAGGGTTGC CGAGTGGCAT CGGTGCTAGA CACCTGAGCA 2000 

TTCCTTTGCC ACATAAAGAC TAAAGCAGGT GGAATTTCAG TTGTAAAAAG 2050 

CTGGTTCCAT TGGTGCTAAG ATTATGTTGT GCCCTTTTCT GCTTCTCACA 2100 

TTCCAACAGA GGAATTTACT TCCAGTTTTC TTCCATTTTC CTCCTCATTT 2150 

10 TAAGTGTCGG TACAGAGGCA ATAATCTGAT AACTCTGTAC CGTCACTTAC 2200 

AAGCAGGGAG AATTTGTAAT TATTACAAAT CCCATTATCT CTGTGCACCA 2250 

CAGCCTTGTA AATTCATTTG TCCCAGGACT CCCTCTTGTG TGTACGTGAG 2300 

ATTGCCGTCT GTATGTATGT ACACACCGTA CTGCAGTATT TGAAGTCAGT 2350 

CAGAAGGTGA ATTACACCAC TTACTCATTG TGTCACGTAG CAAGTGTGCA 2400 

15 AACTGCCATC CATTGTCCTA TTTATTCACA TAACTAGTTT TCTTTGCATT 2450 

TCCAGTGTTG CAAATTGTGT TTAGAAAATT ATGCCATCGA GACTGGTCGA 2500 

ACCTCACATT GTAACTCAGT ATTTACACAC ACGTTTACTT GCTACAGAAA 2550 

TGTAGAAAAA ATAATTGTTG TATATTGAAA GTACAAGTGA CAAAGTTGCA 2 600 

TTTAAAATGG TGAATGTATT TTATATTTCT TTTGTAGACA CAAGAGTTAA 2650 

2 0 TGCATTTTGC TTAATGGAGA TGTATGTAAA CCTAAAATAG CAGTTTGTGC 2700 

ACAAATTATG TATATGTGAA ATGGAGATGG TTTCTAATTT GCTGATTGAT 2750 

TGCCAGTATT AATTTAAACA ACTGTAGTTG TGGGATGTAG TGGGAAGATT 2800 

TTTTTTTTCC TATAAAATTG GTGGATGTAT GTGTCGGAGA TTTTGATTGT 2850 

ATGTGTAAAA TAGTGATCCC AGTAACTGTA AAGCTTTAGA ATACAGTTAC 2900 

25. TGACTGTATA GTTGTACAGG TGTTGTTACT TTTAAGAATT TATTGACACA 2 950 

AAGGTGAAAG TCTATTATTG TATTGTAATG TTTAAAGCAT TTAAGGTTTA 3000 

AAAATCCTAC TTCTGTGTAT AAATGTTACC ATTCCTCATA TAACATAACT 3050 

GTGTAGAAAT ACAGTCAACT TCATGTTCAT TAGCATTTCA CTGTTGTCAC 3100 

ATAAATTATG CCCGGAAT 3118 

3'0 



2) INFORMATION FOR SEQ ID NO: 689 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1836 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Plasmodium falciparum 
45 (B) STRAIN: 3D7 

(C) ACCESSION NUMBER: L08200 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 9 

50 ATGACAAAAG TTGCTGTTGA AAAAGAGGAA CCAGGAGTTG TTTATAAGGT 50 

GGCTGGTTCA TTAGTTATTG CTGAGAATAT GAGTGGAACT CGTATGTACG 100 

AGTTAGCTAA AGTAGGATGG AATAAATTGG TTGGAGAAAT TATTAGATTA 150 

GAAGGGAATT ATGCATATAT ACAAGTTTAT GAAGATACTT CAGGTTTATC 200 

TGTAGGAGAC CCTGTTATAA AAACAGGAAA TGCTTTATCA GTCGAATTGG 250 

55 GTCCTGGTAT TTTAGATAAT ATTTATGATG GTATTCAAAG ACCATTAGAA 300 

AGAATAGCAA ATGTGTGTGG TGATGTATAT ATATATAAAG GTATTGATAT 350 

GACATCTTTA GATCATGATA AACAATGGCA ATTTTATGCT GATAAGAAAT 400 

TAAAATTAAA TGATATTGTT ACTGGTGGAG ATATCTTTGG ATTTGTTGAT 4 50 

GAAAATAAAT TATTTAAAGA ACACAAAATT ATGGCTCCAC CTAATGCTAA 500 

60 AGGGAGGCTT ACATATATTG CTCCAGATGG ATCATATACT TTAAAAGATA 550 

392 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

AAATATTTGA ATTAGAATAT CAAGGAAAAA AATATACATA T&3TTTATCT 600 

CATTTATGGC CTGTTCGTGA TCCTAGACCT GTTTTAGAAA AGGTAACAGG 650 

GGATACTTTA TTATTAACAG GGCAAAGAGT TTTAGATTCG TTATTTCCAA 700 

CAGTTCAAGG AGGTACTTGT GCTATTCCTG GTGCATTTGG TTGTGGAAAA 750 

5 ACTTGTGTTT CTCAGGCCTT ATCAAAATAT TCTAATAGTG AAGTTATTAT 800 

ATATGTAGGA TGTGGTGAAA GAGGTAATGA AATGGCTGAA ATTTTATCCG 850 

ACTTTCCTGA ATTAACTACT AAAGTAGATA ATGAAGATGT AGGTATTATG 900 

CAAAGAACGT GTTTAGTTGC TAATACTTCT AACATGCCTG TCGCTGCAAG 950 

AGAAGCTAGT ATTTATACAG GTATTACTTT ATGTGAATAT TTCCGTGATA 1000 

10 TGGGTTATAA TGCTACCATG ATGGCTGATA GTACAAGTAG ATGGGCAGAA 1050 

GCCTTAAGAG AAATTTCAGG ACGTTTAGCT GAAATGCCTG CAGATAGTGG 1100 

TTATCCAGCT TATTTAGGTG CTAGATTAGC TTCCTTTTAT GAACGTGCAG 1150 

GAAAAGTCAA ATGTATTGGT TCTCCATCTC GTATAGGATC CATTACAATT 1200 

GTGGGTGCTG TGTCTCCACC AGGTGGTGAT TTCTCTGACC CTGTAACTAC 1250 

15 AGCAACCATG TCTATTGTTC AAGCATTTTG GGGGTTAGAT AAAAAACTAG 1300 

CTCAAAGAAA ACATTTCCCT TCTGTTAATT GGTCTACATC CTTTTCAAAG 1350 

TATGTCAGAC AATTAGAACA ATACTTTGAT AATTTTGATC AAGATTTCTT 14 00 

ATC TTTAAG A CAAAAAATTA GTGATATTTT ACAACAAGAA AGTGAC TTGA 1450 

ATGATATTGT TCAACTAGTA GGAAAGGATT CATTATCAGA AGACCAAAAA 1500 

20 GTTGTTATGG AAGTAGCCAA AATTATTAGA GAAGATTTTC TTCAACAAAA 1550 

TGCATTTAGC GATTATGATT ATATGTGCCC ATTACAAAAA ACAGTTGGTA 1600 

TGATGAGAAT TATTTGCCAC TTTTATGCTC AATGCTTAAG AACATTACAA 1650 

GAATATGACT CAAGAGAAAG AAAAATTGGT TGGGGATCTA TATATAATAC 1700 

ATTAAGACCA ACTATAAATA AAATTACACA TATGAAATTT GAAAACCCAA 1750 

25 AAAATTCAGA TGAATATTTC AAAAAGTATT TTAAGGCACT TGAAGAAGAA 1800 

ATAACAGTAG GTTTAAGAAA CTTGATGGAA AAATGA 1836 



30 2) INFORMATION FOR SEQ ID NO: 690 

: (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3216 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(B) STRAIN: X2180-1A 

(C) ACCESSION NUMBER: J05409 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 690 

ATGGCTGGTG CAATTGAAAA CGCTCGTAAG GAAATAAAAA GAATCTCATT 50 

AGAAGACCAT GCTGAATCTG AATATGGTGC CATCTATTCT GTCTCTGGTC 100 

CGGTCGTCAT TGCTGAAAAT ATGATTGGTT GTGCCATGTA CGAATTGGTC 150 

50 AAGGTCGGTC ACGATAACCT GGTGGGTGAA GTCATTAGAA TTGACGGTGA 200 

CAAGGCCACC ATCCAAGTTT ACGAAGAAAC TGCAGGCCTT ACGGTCGGTG 250 

ACCCTGTTTT GAGAACAGGT AAGCCTCTGT CGGTAGAATT GGGTCCTGGT 300 

CTGATGGAAA CCATTTACGA TGGTATTCAA AGACCTTTGA AAGCCATTAA 350 

GGAAGAATCG CAATCGATTT ATATCCCAAG AGGTATTGAC ACTCCAGCTT 400 

55 TGGATAGGAC TATCAAGTGG CAATTTACTC CGGGAAAGTT TCAAGTCGGC 450 

GATCATATTT CCGGTGGTGA TATTTACGGT TCCGTTTTTG AGAATTCGCT 500 

AATTTCAAGC CATAAGATTC TTTTGCCACC AAGATCAAGA GGTACAATCA 550 

CTTGGATTGC TCCAGCTGGT GAGTACACTT TGGATGAGAA GATTTTGGAA 600 

GTTGAATTTG ATGGCAAGAA GTCTGATTTC ACTCTTTACC ATACTTGGCC 650 

60 TGTTCGTGTT CCAAGACCAG TTACTGAAAA GTTATCTGCT GACTATCCTT 700 
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TGTTAACAGG TCAAAGAGTT TTGGATGCTT TGTTTCCTTG TGTTCAAGGT 750 

GGTACGACAT GTATTCCAGG TGCTTTTGGT TGTGGTAAGA CCGTTATCTC 800 

TCAATCTTTG TCCAAGTACT CCAATTCTGA CGCCATTATC TATGTCGGGT 850 

GCTTTGCCAA GGGTACCAAT GTTTTAATGG CGGATGGGTC TATTGAATGT 900 

5 ATTGAAAACA TTGAGGTTGG TAATAAGGTC ATGGGTAAAG ATGGCAGACC 950 

TCGTGAGGTA ATTAAATTGC CCAGAGGAAG AGAAACTATG TACAGCGTCG 1000 

TGCAGAAAAG TCAGCACAGA GCCCACAAAA GTGACTCAAG TCGTGAAGTG 1050 

CCAGAATTAC TCAAGTTTAC GTGTAATGCG ACCCATGAGT TGGTTGTTAG 1100 

AACACCTCGT AGTGTCCGCC GTTTGTCTCG TACCATTAAG GGTGTCGAAT 1150 

10 ATTTTGAAGT TATTACTTTT GAGATGGGCC AAAAGAAAGC CCCCGACGGT 1200 

AGAATTGTTG AGCTTGTCAA GGAAGTTTCA AAGAGCTACC CAATATCTGA 1250 

GGGGCCTGAG AGAGCCAACG AATTAGTAGA ATCCTATAGA AAGGCTTCAA 1300 

ATAAAGCTTA TTTTGAGTGG ACTATTGAGG CCAGAGATCT TTCTCTGTTG 1350 

GGTTCCCATG TTCGTAAAGC TACCTACCAG ACTTACGCTC CAATTCTTTA 1400 

15 TGAGAATGAC CACTTTTTCG ACTACATGCA AAAAAGTAAG TTTCATCTCA 1450 

CCATTGAAGG TCCAAAAGTA CTTGCTTATT TACTTGGTTT ATGGATTGGT 1500 

GATGGATTGT CTGACAGGGC AACTTTTTCG GTTGATTCCA GAGATACTTC 1550 

TTTGATGGAA CGTGTTACTG AATATGCTGA AAAGTTGAAT TTGTGCGCCG 1600 

AGTATAAGGA CAGAAAAGAA CCACAAGTTG CCAAAACTGT TAATTTGTAC 1650 

20 TCTAAAGTTG TCAGAGGTAA TGGTATTCGC AATAATCTTA ATACTGAGAA 1700 

TCCATTATGG GACGCTATTG TTGGCTTAGG ATTCTTGAAG GACGGTGTCA 1750 

AAAATATTCC TTCTTTCTTG TCTACGGACA ATATCGGTAC TCGTGAAACA 1800 

TTTCTTGCTG GTCTAATTGA TTCTGATGGC TATGTTACTG ATGAGCATGG 1850 

TATTAAAGCA ACAATAAAGA CAATTCATAC TTCTGTCAGA GATGGTTTGG 1900 

25 TTTCCCTTGC TCGTTCTTTA GGCTTAGTAG TCTCGGTTAA CGCAGAACCT 1950 

GCTAAGGTTG ACATGAATGG CACCAAACAT AAAATTAGTT ATGCTATTTA 2000 

TATGTCTGGT GGAGATGTTT TGCTTAACGT TCTTTCGAAG TGTGCCGGCT 2050 

CTAAAAAATT CAGGCCTGCT CCCGCCGCTG CTTTTGCACG TGAGTGCCGC 2100 

GGATTTTATT TCGAGTTACA AGAATTGAAG GAAGACGATT ATTATGGGAT 2150 

30 TACTTTATCT GATGATTCTG ATCATCAGTT TTTGCTTGCC AACCAGGTTG 2200 

TCGTCCATAA TTGCGGAGAA AGAGGTAATG AAATGGCAGA AGTCTTGATG 2250 

GAATTCCCAG AGTTATATAC TGAAATGAGC GGTACTAAAG AACCAATTAT 2300 

GAAGCGTACT ACTTTGGTCG CTAATACATC TAACATGCCG GTTGCAGCCA 2350 

GAGAAGCTTC TATTTACACT GGTATCACTC TTGCAGAATA CTTCAGAGAT 2400 

35 CAAGGTAAAA ATGTTTCTAT GATTGCAGAC TCTTCTTCAA GATGGGCTGA 2450 

AGCTTTGAGA GAAATTTCTG GTCGTTTGGG TGAGATGCCT GCTGATCAAG 2500 

GTTTCCCAGC TTATTTGGGT GCTAAGTTGG CCTCCTTTTA CGAAAGAGCC 2550 

GGTAAAGCTG TTGCTTTAGG TTCCCCAGAT CGTACTGGTT CCGTTTCCAT 2600 

CGTTGCTGCC GTTTCGCCAG CCGGTGGTGA TTTCTCAGAT CCTGTTACTA 2650 

40 CTGCTACATT GGGTATCACT CAAGTCTTTT GGGGTTTAGA CAAGAAATTG 2700 

GCTCAAAGAA AGCATTTCCC ATCTATCAAC ACATCTGTTT CTTACTCCAA 2750 

ATACACTAAT GTCTTGAACA AGTTTTATGA TTCCAATTAC CCTGAATTTC 2800 

CTGTTTTAAG AGATCGTATG AAGGAAATTC TATCAAACGC TGAAGAATTA 2850 

GAACAAGTTG TTCAATTAGT TGGTAAATCG GCCTTGTCTG ATAGTGATAA 2 900 

45 GATTACTTTG GATGTTGCCA CTTTAATCAA GGAAGATTTC TTGCAACAAA 2950 

ATGGTTACTC CACTTATGAT GCTTTCTGTC CAATTTGGAA GACATTTGAT 3000 

ATGATGAGAG CCTTCATCTC GTATCATGAC GAAGCTCAAA AAGCTGTTGC 3050 

TAATGGTGCC AACTGGTCAA AACTAGCTGA CTCTACTGGT GACGTTAAGC 3100 

ATGCCGTTTC TTCATCTAAA TTTTTTGAAC CAAGCAGGGG TGAAAAGGAA 3150 

50 GTCCATGGCG AATTCGAAAA ATTGTTGAGC AC TATGC AAG AAAGATTTGC 3200 

TGAATCTACC GATTAA 3216 



55 2) INFORMATION FOR SEQ ID NO: 691 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1860 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS : Double 
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(D) TOPOLOGY: Linear 



PCT/CAOO/01150 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Schizosaccharomyces pombe 

(B) STRAIN: 972 h- 

(C) ACCESSION NUMBER: S47814 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 691 

ATGGCGGGAG GAATTGAACT GGCCAAGAAG GCTATCAGGA GCCTCAAAAA 50 
TTACGACGAG CATGAAAACC GATATGGATC TATTTTCAGC GTTTCTGGTC 100 
CTGTCGTTGT TGCAGCCAAT ATGCTTGGAT GTTCGATGTA CGAACTCGTT 150 
CGCGTTGGTC ATGAAGAACT AGTTGGTGAA GTAATTCGTA TCCATCAAGA 200 
TAAATGTACT ATTCAAGTAT ACGAAGAGAC GTCCGGTCTC ACTGTTGGTG 250 
ATCCTGTCCA ACGCACTGGA AAGCCATTAT CTGTTGAATT AGGTCCTGGT 3 00 

TTAGCTGAGA CTATTTATGA TGGTATCCAA CGTCCGTTAA AGCAAATTTT 350 
CGACAAATCT CAAAGTATTT ATATTCCTAG AGGTATTAAT ACAGAATCAC 400 
TTAATCGTGA GCATAAGTGG GATTTCACAC CAAATAAGGA TTTACGCATT 450 
GGCGATCATG TATCCGGTGG TGATGTTTTC GGTTCTGTAT TTGAAAACTC 500 
TCTTTTCAAT GATCATAAAA TTATGTTACC CCCTAGAGCC CGTGGTACCG 550 
TCACATATAT TGCTGAAGCT GGATCATACC ATGTTGATGA AAAACTTCTT 600 
GAAGTCGAGT TTAATGGCAA GAAACATTCT TTTAGTATGT TGCATACTTG 650 
GCCTGTCCGT GCTGCTCGTC CAGTTGCGGA CAACTTAACT GCTAATCAAC 700 
CTTTATTGAC TGGTCAACGT GTTTTGGATG CGTTATACCC CTGTGTTCAA 750 
GGTGGCACTA CTGCTATCCC CGGTGCCTTT GGTTGTGGTA AAACAGTTAT 800 
TTCACAATCT CTTTCTAAGT ACTCTAATTC TGATTTGATT GTTTACGTCG 850 
GTTGTGGTGA ACGTGGAAAC GAAATGGCAG AAGTGTTAAT GGATTTCCCA 900 
GAACTAACAA TTGATATTAA TGGTAAACCA GAGCCCATTA TGAAGCGTAC 950 

TACATTGGTA GCCAACACTT CTAACATGCC TGTCGCTGCT CGTGAAGCTT 1000 

CCATTTATAC CGGTATTACA CTTGCTGAAT ATTATCGTGA TCAAGGTAAG 1050 

AACGTTTCAA TGATGGCTGA TTCTACATCT CGTTGGGCTG AAGCTTTGCG 11,00 

TGAAATTTCT GGTCGTTTGG CTGAGATGCC TGCCGATTCT GGTTATCCCG 1150 

CTTATTTGGG TGCCAAATTG GCTTCTTTTT ACGAACGTGC TGGTCGTGCT 1200 

CGTTGCTTGG GAAGTCCTGA CCGTGAAGGA ACAGTTTCAA TTGTTGGAGC 1250 

TGTTTCTCCT CCGGGTGGTG ATTTTTCTGA TCCTGTTACT AGTGCAACCT 1300 

TGGGAATTGT TCAAGTCTTC TGGGGTTTGG ACAAGAAATT GGCCCAACGT 1350 

AAACACTTTC CCTCAATCAA CACCTCTCTT TCCTATTCTA AATACATCAA 14 00 

TGCTTTGCAA CCTTGGTATG AGGAAAGAGT TCCAGGCTTT AATACTCTTC 1450 

GTGATCAAAT CAAACAGATC ATTCAACAAG AAGATTCCAT GTTGGAAATT 1500 

ATTCAGTTGG TTGGTAAGTC GGCTCTTTCT GAAACGGATA AAGTTACTTT 1550 

GGACATAGCC GGTATTATTA AGAATGACTT CTTACAACAA AACGGTTATT 1600 

CTGATTACGA TCGCTGTTGC CCTCTTTACA AGACTTATCA TATGATGCGA 1650 

AACATGATTG CTTACTACAC AAAGGCTAAA AGTGCCGTTG AAACTGGTAG 1700 

CGTTCCTTGG TCAAAGATTA AAGAAAGTAC TTCAGATATC TTTTATGAGT 1750 

TAACCTCGAT GAAATTCGAA AACCCTAATG AAGGCGAGAA GGAAATAGTC 1800 

GAACACTATG AAACTCTGCA CAAGAAGATT GAGGACAAGT TTCACACTCT 1850 

GACTGAGTAA 1860 



2) INFORMATION FOR SEQ ID NO: 692 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma congolense 
5 (B) STRAIN: IL3000 

(C) ACCESSION NUMBER: Z25814 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 692 

10 ATGACGAGCG ATAAAAACCC TTACAAAACA GAGCAGCGCA TGGGGGCCGT 50 

GAAGGCCGTC TCCGGGCCAG TTGTCATTGC TGAAAACATG GGCGGTAGCG 100 

CTATGTATGA GCTTGTGCAG GTAGGTTCCT TCCGGTTAGT GGGCGAGATC 150 

ATTCGTCTAG AGGGCGATAC CGCCACTATT CAGGTCTATG AGGAAACAGG 200 

TGGCCTCACT GTCGGAGACC CGGTGTACTG TACGGGTAAG .CCTCTTTCGC 250 

15 TTGAGCTTGG ACCTGGAATC ATGTCTGAAA TATTTGACGG TATCCAGCGG 3 00 

CCTCTTGACA CCATCTACCG CATGGTGGAA AACGTGTTTA TCCCCAGGGG 350 

CGTTCAGGTG AAGTCACTCA ATGACCAGAA ACAGTGGGAC TTTAAGCCAT 4 00 

GCCTGAAGGT TGGAGATCTT GTGTCTGGTG GTGATATCAT TGGCTCAGTG 450 

GTGGAGAACT CTCTCATGTA CAATCACAGC ATTATGATTC CGCCCAATGT 500 

20 GCGGGGCCGT GTTACTTCCA TTGTTCCTTC AGGAAATTAC ACCCTCCAAG 550 

ATGACATTAT TGAATTGGAA TATAATGGGA CAGTGAAATC ACTAAAACTT 600 

ATGCATCGCT GGCCAGTACG GACCCCGCGT CCTGTGGCGT CAAAAGAATC 650 

CGGCAATCAT CCGCTTCTCA CCGGACAGCG TGTGCTCGAT GCTCTCTTTC 700 

CATCCGTCCA GGGTGGAACA TGCGCCATCC CTGGCGCGTT TGGATGCGGA 750 

25 AAGACGGTTA TCAGTCAGGC TCTTTCGAAG TTCTCCAACA GCGACGCTGT 800 

TATCTATGTC GGCTGCGGCG AGCGTGGGAA TGAGATGGCA GAGGTGCTCA 850 

TGGACTTCCC CACACTCACC ACCGTTATTG ATGGTCGTGA GGAGTCCATC 900 

ATGAAGCGTA CCTGCCTGGT GGCAAACACC TCAAATATGC CTGTCGCTGC * 950 

TCGTGAGGCG TCTATTTACA CTGGCATCAC TTTAGCTGAG TATTATCGTG 1000 

30 ATATGGGCAA GCACATTGCT ATGATGGCCG ACTCTACCTC TCGATGGGCT 1050 

GAGGCTCTCC GTGAGATCTC TGGGCGTCTC GCTGAAATGC CCGCTGATGG , 1100 

TGGTTACCCT GCGTACCTCA . GTGCGCGTCT TGCTTCCTTC TACGAGCGTG .1150 

CGGGGCGCGT GACATGCATC GGTGGGCCAA AACGCGAGGG CTCAGTAACC 1200 

ATCGTTGGTG CCGTTTCTCC TCCTGGAGGT GACTTTTCTG ACCCAGTGAC 1250 

35 GTCCGCTACG CTTGGTATTG TGCAAGTCTT TTGGGGTCTT GAGAAGCGTC 13 00 

TTGCGCAACG TAAACACTTT CCTTCTGTTA ATTGGCTCAT TTCCTATTCA 1350 

AAATACCTTA ATGCTTTGGA GCCCTTCTTC AACACGCTTG ACCCTGACTA 1400 

CATGCGCCTG CGGTCAGTTG CTGCGGAGAT CCTTCAGCGT GAGGAAGAGT 1450 

TGCAAGAAAT TGTTCAACTT GTCGGTAAGG ACTCACTTTC GGAGTCTGAC 1500 

40 AAAATTATTC TAGAAACGGC TAAGGTTATT CGTGAAGAGT TTCTCCAGCA 1550 

GAATGCCTTT ACGCCGTACG ACAAGTATTG CCCGCCGTAC AAGACCTGCT 1600 

GGATGCTACG TAACATTGTC GCGTTCTACG AGGAGAGCCA GCGCGTTGTA 1650 

GCTGAGTCCG CTGGGGAACT TAAGATTACG TGGAACTACA TTCGTGAAAT 1700 

GATTCCTCAT ATTTACACGG GTTTAACTGA GATGAAGTTC CGTGATCCTC 1750 

45 AGGAGGGTGA GGAGGCCAAC GTAGAATTCT ACAGAAAACA AAATGAGGAA 1800 

ATTGTCAGCG CATTCGCCTC GCTGCTGCAA TAA 1833 



50 2) INFORMATION FOR SEQ ID NO: 693 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1758 ' bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Thermus thexwophilus 

(B) STRAIN: HB8 

(C) ACCESSION NUMBER: D63799 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 693 

ATGATCCAAG GGGTGATCCA GAAGATCGCG GGCCCGGCGG TGATCGCCAA 50 

GGGCATGCTC GGGGCCCGCA TGTACGACAT CTGCAAGGTG GGCGAAGAGG 100 

GCCTCGTGGG CGAGATCATC CGCCTGGACG GGGACACGGC CTTCGTCCAG 150 

10 GTCTACGAGG ACACCTCGGG CCTAAAGGTG GGGGAGCCCG TGGTCTCCAC 200 

GGGCCTTCCC TTGGCGGTGG AGCTCGGCCC CGGGATGCTG AACGGCATCT 250 

ACGACGGCAT CCAGCGCCCC CTGGAGCGCA TCCGGGAGAA GACGGGGATC 300 

TACATCACCC GGGGCGTGGT GGTCCACGCC CTGGACCGGG AGAAGAAGTG 350 

GGCCTGGACG CCCATGGTCA AGCCCGGGGA CGAGGTGCGG GGGGGTATGG 400 

15 TCCTGGGCAC GGTGCCCGAG TTCGGCTTCA CCCACAAGAT CCTGGTACCC 450 

CCGGACGTGC GGGGCCGGGT CAAGGAGGTG AAGCCCGCCG GGGAGTACAC 500 

CGTGGAGGAG CCGGTGGTGG TCCTCGAGGA CGGCACCGAG CTCAAGATGT 550 

ACCACACCTG GCCCGTTCGC CGGGCGAGGC CCGTGCAAAG GAAGCTTGAC 600 

CCCAACACCC CCTTCCTCAC GGGGATGCGC ATCCTGGACG TCCTCTTCCC 650 

20 CGTGGCCATG GGGGGCACCG CCGCCATCCC TGGGCCCTTC GGCAGCGGCA 700 

AGACCGTGAC CCAGCAGTCC CTGGCCAAGT GGTCCAACGC CGACGTGGTG 750 

GTCTACGTGG GCTGCGGGGA GCGGGGGAAC GAGATGACCG ACGTGCTCGT 800 

GGAGTTCCCC GAGCTCACCG ACCCCAAGAC GGGTGGGCCC TTGATGCACC 850 

GCACCGTCCT CATCGCCAAC ACCTCCAACA TGCCCGTGGC CGCCCGCGAG 900 

25 GCCAGCATCT ACGTGGGCGT GACCATCGCC GAGTACTTCC GCGACCAGGG 950 

CTTCTCCGTG GCCCTCATGG CCGACTCCAC GAGCCGCTGG GCCGAGGCTT 1000 

TGCGCGAGAT CTCTAGCCGC CTCGAGGAGA TGCCCGCCGA GGAGGGCTAC 1050 

' CCGCCCTACC TCGCCGCCAG GCTCGCCGCC TTCTACGAGC GGGCGGGCAA 1100 

GGTCATCACC CTGGGCGGCG AGGAGGGGGC GGTGACCATC GTGGGGGCCG 1150 

30 TCTCCCCGCC GGGCGGCGAC ATGTCCGAGC CCGTGACCCA GTCCACCTTG 1200 

. AGGATCGTGG GGGCCTTCTG GCGGCTTGAC GCCTCCCTGG CCTTCCGCCG 1250 

CCACTTCCCC GCCATCAACT GGAACGGCTC CTACAGCCTC TTCACCTCCG 1300. 

CCCTTGACCG e CTGGTACCGG GAGAACGTGG CCGAGGACTA CCCCGAGCTC 1350 

CGCGACGCCA TCTCCGAGCT TTTGCAGCGG GAGGCGGGCC TCCAGGAGAT 1400 

35 CGTCCAGCTC GTGGGGCCGG ACGCCCTCCA GGACGCCGAG CGCCTCGTCA 1450 

TTGAGGTGGG CCGGATCATC CGCGAGGACT TCCTGCAGCA GAACGCCTAC 1500 

CACGAGGTGG ACGCCTACTG CTCCATGAAG AAGGCCTACG GGATCATGAA 1550 

GATGATCCTC GCCTTCTACA AGGAGGCGGA GGCGGCCATC AAGCGGGGGG 1600 

TTTCCATAGA CGAGATCCTG CAGCTCCCCG TTCTGGAGCG CATCGGCCGC 1650 

40 GCCCGCTACG TGAGCGAGGA GGAGTTCCCC GCCTACTTTG AGGAGGCCAT 1700 

GAAGGAGATC CAGGGGGCCT TCAAGGCTGG CCTAAAGGGG GAGAGATGGA 1750 

CCTTCTGA 1758 



45 



55 



60 



2) INFORMATION FOR SEQ ID NO: 694 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 694 
CGGCGCIATC YTSGTTGTTG C 21 
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2) INFORMATION FOR SEQ ID NO: 695 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 695 
GTTTCACGTG ATGACGTACA 



2 ) INFORMATION FOR SEQ ID NO: 696 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 696 
ATIGGICAYR TIGAYCAYGG IAARAC 



2) INFORMATION FOR SEQ ID NO: 697 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 697 
CCIACIGTIC KICCRCCYTC RCG 



2) INFORMATION FOR SEQ ID NO: 698 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1185 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) . TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: extracted from J01690 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 698 

5 

GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTTA ACGTTGGTAC 50 

TATCGGCCAC GTTGACCACG GTAAAACTAC TCTGACCGCT GCAATCACCA 100 

CCGTACTGGC TAAAACCTAC GGCGGTGCTG CTCGTGCATT CGACCAGATC 150 

GATAACGCGC CGGAAGAAAA AGCTCGTGGT ATCACCATCA ACACTTCTCA 200 

10 CGTTGAATAC GACACCCCGA CCCGTCACTA CGCACACGTA GACTGCCCGG 250 

GGCACGCCGA CTATGTTAAA AACATGATCA CCGGTGCTGC TCAGATGGAC 3 00 

GGCGCGATCC TGGTAGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG 350 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 400 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 450 

15 GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGACTTCC CGGGCGACGA 500 

CACTCCGATC GTT CGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG 550 

AGTGGGAAGC GAAAATCCTG GAACTGGCTG GCTTCCTGGA TTCTTATATT 600 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 650 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC 700 

20 GCGGTATCAT CAAAGTTGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAG 750 

ACTCAGAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 800 

CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC 850 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAG 900 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG 950 

25 CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 1000 

GTACTACTGA CGTGACTGGT ACCATCGAAC TGCCGGAAGG CGTAGAGATG 1050 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 1100 

CGCGATGGAC GACGGTCTGC GTTTCGCAAT CCGTGAAGGC GGCCGTACCG 1150 

TTGGCGCGGG CGTTGTTGCT AAAGTTCTGG GCTAA 1185 

30 

2) INFORMATION FOR SEQ ID NO: 699 . 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
"(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 699 
45 GTIACIGGYT CYTYRARRTT ICCICC 26 



50 



2) INFORMATION FOR SEQ ID NO: 700 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 700 

60 
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2) INFORMATION FOR SEQ ID NO: 701 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 701 
GTGTTCACGA TCATCGATGC G 



2) INFORMATION FOR SEQ ID NO: 702 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 702 
CTCTCGATAT CCGCGAAGCG 



2) INFORMATION FOR SEQ ID NO: 703 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 703 
TATGGAAATT CGAAACATCT 



2) INFORMATION FOR SEQ ID NO: 704 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

400 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 704 
AGTGCTCCAA TTAATGTTGG 



2 ) INFORMATION FOR SEQ ID NO: 705 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 705 
GTACAGTTCC AATACCTGAA 



2) INFORMATION FOR SEQ ID NO: 706 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 706 
TGAAATCTTC ACATCCAACA 



2) INFORMATION FOR SEQ ID NO: 707 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 707 
TWACCATTTC AGTACCTTCT GGTAA 



2) INFORMATION FOR SEQ ID NO: 708 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 

401 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 708 
TCRTCCATIC CIARIATIGC I AT I AT 



2) INFORMATION FOR SEQ ID NO: 709 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1656 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) , ORGANISM: Borrelia burgdorferi 

(C) ' ACCESSION NUMBER: extracted from AE001122 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 709 

ATGAATGAAG TTTTATTTGT AAAGACTGCT GGTAGGAATT TAAAAGCAGA 50 

AGTAATTCGT ATTAGGGGCA ATGAAGTTGA TGCACAGGTT TTTGAATTGA 100 

CAAAAGGGAT ATCTGTTGGA GACCTAGTTG AATTTACAGA CAAACTTTTA 150 

ACAGTTGAAC TCGGACCAGG GCTTTTAACT CAAGTATATG ATGGGCTTCA 2 00 

AAATCCTTTG CCTGAATTGG CTATTCAATG TGGATTTTTT TTAGAAAGGG 250 

GAGTATATTT AAGGCCCTTG AATAAAGATA AAAAGTGGAA TTTTAAAAAA 3 00 

ACCTCCAAAG TTGGAGATAT CGTTATTGCA GGAGATTTTT TAGGTTTTGT 350 

AATTGAGGGA ACTGTTCACC AT CAAATAAT GATTCCATTT TATAAAAGGG 400 

ATTCTTATAA AATTGTGGAG ATTGTAAGTG ATGGCGACTA TTCGATTGAT 450 

GAGCAAATTG CTGTAATTGA AGATGATTCT GGTATGAGGC ATAATATTAC 500 

AATGTCTTTT CATTGGCCTG TTAAAGTTCC TATTACTAAT TATAAGGAAC 550 

GCCTTATTCC TAGTGAACCT ATGTTGACTC AAACTAGAAT TATAGATACA 600 

TTTTTCCCAG TTGCCAAAGG TGGAACTTTT TGCATTCCGG GTCCTTTTGG 650 

AGCAGGAAAA ACGGTTCTTC AGCAGGTTAC AAGTCGAAAT GCTGATGTTG 700 

ATGTAGTGAT TATTGCAGCT TGTGGTGAGC GAGCAGGAGA AGTGGTAGAA 750 

ACTCTTAAAG AATTTCCCGA ATTAATGGAT CCAAAAACCG GCAAATCTTT 800 

AATGGACAGG ACTTGTATTA TTTGTAATAC ATCTTCAATG CCAGTTGCAG 850 

CTAGAGAAGC TTCTGTTTAT ACTGCTATTA CTATTGGTGA GTATTACAGG 900 

CAAATGGGCC TTGATATTCT TCTTTTGGCA GATTCAACTT CAAGATGGGC 950 

TCAAGCAATG AGAGAAATGT CTGGACGCCT TGAGGAAATT CCTGGCGAGG 1000 

AGGCTTTTCC GGCATATCTT GAGTCTGTTA TTGCTTCCTT TTATGAAAGG 1050 

GCAGGTATTG TAGTTCTTAA TAATGGGGAT ATTGGATCTG TAACAGTTGG 1100 

TGGCTCTGTA AGTCCTGCTG GTGGTAATTT TGAAGAGCCA GTTACTCAAG 1150 

CAACTTTAAA AGTTGTAGGA GCATTTCACG GGCTTACAAG AGAAAGGTCT 1200 

GATGCTAGGA AATTTCCAGC TATTAGTCCT CTTGAATCTT GGAGTAAATA 1250 

TAAAGGCGTT ATTGATCAAA AAAAGACTGA ATATGCAAGA TCTTTTTTGG 1300 

TGAAAGGTAA TGAAATTAAT CAAATGATGA AAGTTGTTGG AGAAGAAGGC 1350 

ATAAGTAACG ATGATTTTTT AATTTATTTA AAATCCGAGC TACTTGATTC 1400 

GTGCTATTTG CAGCAAAATT CATTTGATTC TATTGATGCT GCTGTTAGTT 1450 

CAGAGCGTCA AAATTATATG TTTGATATAG TTTATAACAT TCTTAAAACT 1500 

AACTTTGAGT TTTCTGATAA ACTTCAAGCA AGAGATTTTA TAAATGAGTT 1550 

AAGGCAAAAT CTTTTAGACA TGAATCTTTC TTCTTTTAAG GATCATAAGT 1600 
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TTAATAAATT GGAGCATGCT TTGGGTGAAT TGATAAATTT TAAAAAGbTA 1650 
ATTTAG 1656 



2) INFORMATION FOR SEQ ID NO: 710 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1818 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Treponema pallidum 

(B) STRAIN: Nichols 

(C) ACCESSION NUMBER: extracted from AE000520 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 710 



GTGATCAAAG ACGATGTGGT TACAGGCCGT GTAGTGAGGG TGTCTGGTCC 50 

CATTGTGTAT GCCGAGGGCC TCTCTGCGTG CAGCGTATAC GATGTTGTCG 100 

25 ACGTAGGGGA AGCATCGCTC ATCGGAGAAA TTATCCGGTT GGATGAGAGC 150 

AAGGCGGTCG TGCAAGTATA CGAGGATGAC ACAGGTATGC GAGTCGGGGA 2 00 

GAAGGTGACA AGCTTGCGTC GACCACTCTC AGTCCGCTTA GGGCCTGGAT 250 

TAATCGGCAC CATTTATGAC GGTATTCAGC GCCCACTTGA GCGCCTCTTC 3 00 

CAAGAAGACG GCGCCTTCTT GCGTCCTGGT GCGCGTTCAC AACCGCTTGA 350 

30 TGGCTCCGTA CGCTGGGATT TTCGTCCTCA TTGTAACGAG CGCGGTGAGG 4 00 

CCCTGTGCGC GGGGATTCCG ATTGCACCTG GGTCAGTGTT AGGGACCGTG 450 

CAGGAGACTC CTTCTGTTGT GCACACTATC ATGGTTCCTC CTGACATCCG 500 

GGGGAGCGTG CTATCTTCGT TCAAGGGCGC AGGTGCTTAC ACAATAGATG, 550 

AAGAAATTGG ACGCACTGAT CTTGGTGAGC CGCTTTTTCT ATCCCAGTAC 600 

35 TGGCCAGTGC GTCGTGCGCG TCCTTTCAGC AAAAAACTTG CAGTGTGTGA 650 

GCCACTAGTT ACTGGACAGC GGGCGATTGA TGTTTTCTTC CCCCTATCAA 700 

AGGGAGGAAC GGCGGCTATT CCAGGGGGAT TTGGAACTGG GAAGACAATG 750 

ACGCAGCATG CCGTTGCCAA GTGGTGTGAT GCAGATATTA TCGTGTACAT 800 

CGGCTGCGGA GAGCGGGGCA AC GAGATG AC AGACGTGCTC TCTGAATTTC 850 

40 CCAAACTCAT CGATCCGCGC ACAGGACGCT CTCTTATGGA GCGGACGATT 900 

TTGATCGCAA ATACGTCCAA TATGCCTGTG TCCGCACGCG AGGTGTCGCT 950 

GTATTCAGGG ATTACCCTTG CGGAATACTA CCGTGATATG GGTATGCATG 1000 

TGGCCATCAT GGCTGATTCT ACCAGCCGCT GGGCGGAGGC GCTGCGTGAA 1050 

TTGTCTGGGC GCATGGAAGA AATGCCTGCG GAGGAGGGAT TCCCTGCGTA 1100 

45 CCTTCCGACG CGTCTTGCAG AATTTTATGA GCGCGCAGGA CGCGTGGAAA 1150 

CCTGTGTGGC GCGCGAGGGC TCTGTGAGCA TCATTGGTGC TGTTTCTCCC 1200 

CTGGGTGGAG ATTTCTCTGA GCCGGTGACG CAGCACACAA AGCGCTTCAT 1250 

CCGTTGCTTT TGGGCCTTGG ATCGTGAACT TGCACACGCG CGTCATTACC 13 00 

CTGCCATTGG GTGGATAGAT TCATACTCTG AATATGCGCA GGAAGTAAGT 1350 

50 GCATGGTGGA GTAAGTATGA CCCGCGCGCA GGCGCGTTGC GCGCCGCAGC 1400 

CTTGGATTTG CTGAGAAAGG AACAGCGGTT ACAGCAAATT GTCAGGCTTG 1450 

TCGGTCCTGA TGCGCTGCCT GGAGAAGATC GTCTGGTGCT AATGGTGTGT 1500 

GAAATGATCA AAGGTGGCTT TCTGCAGCAG AACGCTTTTG ATCCGACGGA 1550 

TGTGTTCTCC TGTCCCGAAA AGCAGGTGCA GATCTTGCGT ACCATAGTGG 1600 

55 ATTTTCACGA ACGTGCCGTG GTGCTGCTGC GTGCAGGTAT TTCGCTTTCT 1650 

GCGCTGTCCC AGCTTTCGTG CCGGGAGCTC ATCGTACGTA TGAAAACTAC 1700 

GTACGGGAAT GAGGATGTAC ACAAGATGCA GAAAGTGTAC GACACGATGT 1750 

GCACTGAGTT TGACCAACTG AGTGTGTGTG CTGCCGCGCG CACACAAGGG 1800 

GGGGAGAAAG TCGAATGA 1818 

60 
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2 ) INFORMATION FOR SEQ ID NO: 711 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1779 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia trachomatis 
15 (B) STRAIN: MoPn 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 711 

CAGGGCTATG TCGTAGAAGC TTACGGAAAT TTATTGCGGG TGCATTTTGA 50 

20 TGGGCATGTG CGTCAAGGAG AAGTGGCCTA TGTCAGCGTG GATGATACTT 100 

GGTTGAAAGC GGAAATTATA GAAGTTGTGG GAGATGAGGT TAAAGTCCAA 150 

GTTTTTGAGG AAACTCAAGG AATTTCTCGA GGCGCTTTGG TAACTTTTTC 200 

CGGGCATTTA TTAGAAGCGG AACTTGGGCC CGGTCTATTG CAAGGTATTT 250 

TTGACGGACT TCAGAATCGC TTAGAGGTAT TGGCAGATAC AAGCTTGTTC 300 

25 TTGAAAAGAG GGGAGTATGT TAATGCCATT TGTCGGGAAA CTGTATGGGC 350 

TTATACGCAA AAGGCTTCTG TCGGGGATGT TCTATCTCGG GGAGATGTGC 400 

TTGGTACAGT AAAGGAAGGG CGGTTTGATC ATAAAATCAT GGTTCCTTTC 450 

TCTTGTTTTG AGGAAGTGAC TATCACTTGG GTCATTTCTT CAGGAGATTA 500 

CACTGTTGAT ACCGTTATTG CTAAAGGACG TACTGCTTCA GGAGCCGAGC 550 

3 0 TTGAATTTAC AATGGTTCAG AAATGGCCCA TTAAACAGGC TTTTTTAGAA 600 

GGGGAAAAGG TACCGTCTCA TGAAATTATG GATGTTGGGT TACGAGTATT 650 

AGATACTCAG ATCCCCGTCT TAAAGGGAGG AACTTTTTGT ACTCCAGGGC 700 

CTTTTGGTGC AGGAAAGACC GTTTTACAGC ACCATTTATC TAAGTATGCA 750 

GCTGTAGATA TCGTAGTTTT GTGTGCTTGT GGAGAGCGAG CTGGAGAGGT 800 

35 TGTAGAAATT CTTCAGGAGT TCCCGCATTT GACAGATCCT CATACGGGGC 850 

AGTCTTTGAT GCATAGGACC TGTATTATTT GTAATACATC TTCCATGCCT 900 

GTAGCAGCTA GAGAGT CCTC CATTTATTTG GGTATTACTA TAGCAGAATA 950 

TTACCGTCAA ATGGGGTTGC ATGTTTTGTT ATTGGCTGAC TCGACATCTA 1000 

GATGGGCTCA AGCTTTAAGG GAAATTTCAG GGCGATTAGA AGAAATCCCT 1050 

40 GGAGAAGAAG CTTTCCCAGC CTATTTGGCG TCTCGAATAG CAGCTTTTTA 1100 

TGAGCGAGGC GGGGCTGTGA AAATGAAAGA TGGATCGGAA GGCTCCTTGA 1150 

CTATCTGTGG AGCGGTTTCT CCCGCAGGAG GAAATTTTGA AGAGCCTGTT 12 00 

ACACAAGCAA CTTTATCTGT TGTTGGGGCT TTCTGTGGGC TTTCTAAGGC 1250 

TAGAGCAGAT GCTAGACGGT ATCCTTCTAT TGATCCGATG ATTTCATGGT 1300 

45 CTAAGTACTT GGATTCTGTG GCGGAGATTT TAGAGAAAAA AGTTCCAGGA 1350 

TGGGGAGATT CCGTTAAAAA AGCTTCTCGT TTCTTAGAAG AAGGAGCAGA 1400 

AATTGGTAAG CGAATAGAAG TTGTTGGGGA AGAAGGGATT TCTATGGAAG 1450 

ATATAGAAAT CTTTTTGAAA TCAGAGTTGT ATGATTTCTG TTACTTACAG 1500 

CAAAACGCTT TCGATGCAGA GGACTGTTAT TGTCCTTTTG ATCGTCAAAT 1550 

50 AGAGCTTTTT TCTTTAATGA GTCATATTTT TAGCTCTAGA TTCTGTTTTG 1600 

ATTGTCCAGA TAATGCTCGG AGTTTCTTTT TAGAGCTTCA AAGTAAAATT 1650 

AAAACGCTGA ATGGTCAAAA ATTCCTTTCT GAAGACTATC AGAAGGGGCT 1700 

AGAAGTGATC TATAAACTAT TAGAAAGCAA AATGGTGCAG ACGGCGTAGG 1750 

TATGCAAACA ATATATACAA GAATTACGG 1779 

55 



2) INFORMATION FOR SEQ ID NO: 712 

60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 965 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecal is 
10 (B) STRAIN: V583 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 712 



GTGCAAATTG GAAAAATTGT CAAAGTTTCA GGTCCTTTGA TTTTAGCTGA 50 

15 AAACATGTCA GATGC TAGT A TCCAAGACAT TTGTCATGTA GGAGATTTAG 100 

GCGTTATCGG AGAGATTATT GAAATGCGAG GCGACGTCGC TTCGATTCAA 150 

GTATATGAAG AAACAACAGG CATTGGACCA GGAGAACCAG TTATTTCAAC 200 

AGGAGAACCA TTATCTGTTG AATTAGCCCC AGGTTTAATT GCCGAAATGT 250 

TTGATGGTAT TCAACGACCA TTGGATACAT TTCAAGAAGT AACCCACAGT 300 

20 AACTTTTTAG GCCGTGGCGT TAAAATTGAT GCGTTAGATC GTGAGAAAAA 350 

ATGGACGTTT GAACCAACTG TGGCAGTTGG TGAAGAAGTG TCGGCAGGTG 400 

ACATCGTCGG TGTGGTTCAA GAAACACCGA TTATTCAACA TAAAATTATG 450 

GTGCCTTTCG GCGTTTCAGG AACGATTGCC GAAATTAAAG CAGGTGACTT 500 

TGCCATTGAT GAAACAGTTT ACTCAGTGGA AACGGCTAAA GGAACGGAAA 550 

25 GTTTTAGCAT GATGCAAAAA TGGCCCGTTC GGCGGGGACG TCCCATTTTA 600 

GAAAAACTAA GTCCCAAAGT ACCGATGGTG ACCGGACAAC GCGTAATTGA 650 

TACCTTTTTC CCAATTACGA AAGGCGGAGC GGCAGCAGTT CCAGGACCAT 700 

TTGGCGCTGG AAAAACAGTC GTTCAGCACC AAATTGCTAA GTGGGCCGAT 750 

GTCGACTTAG TCGTTTACGT TGGTTGTGGG GAACGCGGGA ATGAAATGAC 800 

30 AGATGTTTTA AATGAATTTC CAGAATTAAT TGACCCAACA ACTGGTGAGT 850 

CTTTGATGAA TCGGACGATT TTAATTGCGA ATACGTCAAA TATGCCGGTA 900 

GCGGCACGGG AAGCCTCGAT TTATACAGGG ATTACCATTG CAGAATATTT 950 

CCGTGATATG GGTTA ■ 965 

35 



2 ) INFORMATION FOR SEQ ID NO: 713 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1737 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Methanosarcina barker! 

(C) ACCESSION NUMBER: extracted from J04836 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 713 



GTGGAAGTAA AAGGTGAAAT 
CATCGGCTTG CAGGCAAAAA 

55 GTTTAATGGG TGAAGTCATT 
GTATATGAAG AGACCGCAGG 
AGGGTCGTCT CTGTCCGTAG 
ATGACGGGGT TCAAAGGCCT 
TTCATCCAGA GAGGTGTCAG 

60 GGATTTCAAA CCCATTGTCA 



TTATCGTGTG TCTGGGCCTG 
TGTATGACCT GGTCAAAGTC 
CAGATATTAG GGCCCAAGAC 
TATCAAGCCA GGGGAACCCT 
AACTTGGTCC GGGTCTTCTT 
CTGCACGTCC TGCTTGAAAA 
CGCAGATGGG CTTGATCATA 
AGAAGGGCGA TTCCGTAAAA 

405 



TCGTCACCGC 50 

GGTAATGAAG 100 

CATCATCCAG 150 

GTGTATCTAC 200 

TCCAGTATTT 250 

AATGGGTAGC 300 

AGAAACTCTG 350 

GGTGGAGACG 4 00 
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TAATTGGTGT TGTACAGGAA ACCGTGAATA TTGAACATAA GATC^TGGTG " * ~45fo 

CCTCCTGATA TCTCAGGTAC AATTTCCGAC ATAAAGAGCG GAAACTTTAC 5^0 

GGTAGTAGAC ACAATCTGTA CTCTGACTGA TGGGACCGAA TTGCAGATGA 550 

TGCAGAGGTG GCCTGTTCGA AGACCCAGAC CTGTGAAGGC AAAACTTACT 600 

5 CCAACCAGGC CTCTGGTTAC AGGAATGAGA ATCCTTGATG GGCTTTTCCC 650 

TGTGGCAAAA GGCGGAACAG CTGCAATCCC CGGACCTTTC GGATCGGGAA 700 

AGACCGTAAC TCAGCAGTCG CTTGCAAAAT GGAGTGATAC CGAAATTGTG 750 

GTCTACATCG GTTGTGGTGA GCGTGGAAAC GAAATGGCAG ATGTTCTGAG 800 

CGAATTCCCT GAACTCGAAG ATCCGCAGAC CGGGCGCCCA CTTATGGAGC 850 

10 GTACTGTTCT TATCGCTAAC ACTTCAAACA TGCCTGTGGC CGCAAGAGAA 900 

GCATCTGTGT ATACCGGAAT CACCATTGCA GAATACTACC GTGACATGGG 950 

ATTAGATGTA TCCCTTATGG CAGACTCCAC CTCAAGGTGG GCAGAAGCCA 1000 

TGAGAGAAAT CTCTTCCCGT CTGGAAGAAA TGCCTGGTGA AGAAGGTTAC 1050 

CCAGCATACC TGTCTGCAAG ACTGGCCGAA TTCTACGAGC GTGCCGGGGT 1100 

15 TGCGGAGAGT CTTTGCGGCG AAACAGGTTC CATTACTGTT ATTGGAGCAG 1150 

TATCTCCACC TGGCGGTGAC TTCTCAGAGC CTGTTACACA GAATACCCTG 1200 

CGTATCGTAA AAGTGTTCTG GGCTCTCGAT GCCAAACTAT CTCAGAGGCG 1250 

TCACTTCCCG GCCATCAACT GGCTGAACAG TTACAGTCTG TATAAGGACA 1300 

GTCTTAATGA CTGGTTTGCA GATAATGTGG CTCCTGATTA TGTGCCTTTG 1350 

20 AGGGAAAGAG CAATGGAAAT GCTCCAGACA GAATCTGAAC TGCAGGAAAT 1400 

CGTGCAGCTT GTAGGTTCCG ATGCTCTGCC AGACGACCAG CAGCTTCTGC 1450 

TTGAAATCAC CCGTATGCTT AGGGAAATTT TCCTGCAGCA GAATGCATTC 1500 

CACCCAGTAG ATGCATACAG CCCGTTCGAT CAGCAGTACA AGATCCTTAA 1550 

GGCAATCATG AAATGGGGAG ACGCTGCGAT GGATGCCTTG AAATCAGGTG 1600 

25 TTCCCGTAAC TGAAATTATC AAGCTTGAAT CCAAAAATGT GCTTGCTAAG 1650 

GTCAAGTACG AAGAGAAGTT TGATGAGTCT ATGAATGCTG TCCTGGCACA 1700 

GATGGATAAA GAGTTTGCAT CCCTGAGAGG TAGGTAA 1737 



30 



40 



2) INFORMATION. FOR SEQ ID NO: 714 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1785 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Methanosarcina jannaschii 

(C) ACCESSION NUMBER: extracted from U67477 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 714 

ATGAAAAGAG AGGTTGAGAA TATGCCAGTT GTTGGTAAGA TTATTAAAAT 50 

CGCAGGGCCT GTTGTAGTTG CAGAGGGAAT GAAAGGAGCT CAGATGTATG 100 

AGGTCGTTAA AGTAGGAGAA GAGAAATTGA CTGGAGAAAT CATTCAGTTG 150 

50 CACGATGATA AAGCAGTTAT TCAGGTTTAT GAAGAAACAT CTGGAATTAA 200 

ACCAGGAGAG CCAGTTGTTG GTACTGGAGC TCCATTGTCT GTTGAATTAG 250 

GGCCAGGGAT GTTAAGAGCT ATGTATGATG GTATTCAGAG GCCTTTAACA 3 00 

GCAATTGAAG AGAAAACAGG TTCAATCTTT ATCCCAAGAG GAGTTGATGT 350 

CCCTGCATTA CCAAGAGATA TAAAATGGGA ATTTAAACCA GTGGTAAATG 400 

55 AAGGAGATTA TGTTGAAGAA ■ GGAGACATAA TTGGAACTGT TGATGAAACT 450 

CCTTCAATAG TTCATAAAAT CTTAGTTCCA ATTGGTGTTA AAGGAAAAAT 500 

TGTTGAAATA AAAGAGGGTA AATTTACAGT TGAAGAGACA GTTGCAGTTG 550 

TAGAAACAGA AAATGGAGAA AGGAAAGAAA TTACAATGAT GCAAAAATGG 600 

CCAGTAAGAA AACCAAGACC ATATAAAGAG AAACTACCTC CAGAAATTCC 650 

60 ATTAATTACA GGGCAAAGAG TTGAAGACAC TTTCTTTACA TTAGCAAAAG 700 

406 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

GAGGAACAGC AGCAATTCCA GGTCCATTCG GTTCAGGAAA AACCSGTTACT "750 

CAGCATCAGT TGGCAAAGTG GTCTGACGCT GATGTCGTTG TTTATATCGG 800 

ATGTGGAGAA AGAGGAAACG AGATGACAGA GGTTATTGAA GAGTTCCCAC 850 

ACTTAGAAGA TATTAGAACT GGAAACAAAT TAATGGATAG AACTGTATTA 900 

5 ATAGCCAACA CATCAAACAT GCCTGTCGCT GCAAGGGAAG CATCTGTCTA 950 

TACAGGAATT ACAATTGCAG AGTACTTCAG AGATATGGGT TATGGAGTTT 1000 

TATTAACAGC AGATTCAACA TCAAGATGGG CAGAGGCAAT GAGAGAAATT 1050 

TCAGGTAGAT TGGAAGAAAT GCCAGGGGAA GAAGGGTATC CAGCATACTT 1100 

AGCTTCAAGA TTGGCTCAGT TCTATGAAAG AGCTGGAAGA GTTATAACCT 1150 

10 TAGGGAAAGA TAACAGACAA GGATTCGTTT GTATCGTTGG AGCTGTTTCA 12 00 

CCACCAGGAG GGGACTTCTC AGAACCAGTT ACATCAAACA CACTAAGGAT 1250 

AGTTAAGGTA TTCTGGGCGT TAGATGCAAA CTTGGCAAGA AGAAGACACT 1300 

TCCCAGCTAT CAACTGGTTG CAGAGTTATT CATTATACAT TGATGATGTT 1350 

ACAGAGTGGT GGAACACAAA TACTGGTCCA GATTGGAGAC AATTAAGAGA 14 00 

15 TGAAGCAATG AGCTTATTAC AAAAAGAGGC AGAGTTGCAA GAGATTGTTC 14 50 

AGTTAGTTGG GCCTGATGCA TTGCCAGATA GGGAGAGAGT TATTTTAGAA 1500 

GTTGCAAGAA TGTTGAGGGA GGATTTCTTA CAGCAAGATG CGTTTGATGA 1550 

GGTAGATACC TACTGTCCTC CAATGAAACA GTACTTAATG TTAAAGATAA 1600 

TTATGACATT CTACCAAGAA GCATTGAAGG CAGTTGAAAG AGGAGTTGAA 1650 

20 CCAGCTAAGA TTTTAGGAGT TTCAGTTAAG CAAGATATTG CAAGAATGAA 1700 

ATACATCCCA CACGATGAGT TTATAAATGT TAAATCAAAA GAAATAATGG 1750 

AGAAAATTAA GAATGAATTA GGTTCATTAA ACTAA 1785 



25 



2) INFORMATION FOR SEQ ID NO: 715 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1354 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) : TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(B) STRAIN: W83 

40' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 715 

TGAGCGAGGT GATCAAGGTG ATCGGCAAAA ATGCTTATGT GCAGGTTTTC 50 

GAAAGTACTC GCGGTATGCA CGTAGGAGAT GAGGCAGAGT TTACCGGCAG 100 

TATGCTTGAG GTAACGCTCG GCCCCGGTAT GCTTTCGAAG AACTACGACG 150 

45 GTCTGCAACA CGACTTGGAC AAGATGGACG GGATCTTCCT CAAACGAGGC 2 00 

GATTATACTC CCGCTCTCGA TGACGACAAG CTGTGGGACT TCAAGCCTTT 250 

GGCCAATGTG AACGACAATG TGATCGCAGG CTCATGGCTC GGAGAGGTGA 3 00 

CGGAAAATTT CCAACCGCAC AAGATCATGG TACCTTTCGT TTTCGAAGGC 350 

AATTACAAGG TGAAGAGTCT GGCCAAAGCC GGTTCGTACA AAGTGAACGA 400 

50 TGTGATCGCT GTGGTAACGG ATCAGGACGG GAAAGACCAC AATGTAACCA 450 

TGGTGCAGAA ATGGCCGGTG AAACGTGCTA TCACTTGCTA TCGCGAGAAG 500 

CCGCGTCCTT TCAAACTGCT CGAAACGGGT ATCCGTATCA TCGACACTTT 550 

CAACCCCATC GTAGAGGGTG GTACGGGATT TATCCCCGGT CCTTTCGGTA 600 

CGGGAAAGAC GGTGCTCCAG CATGCTATCT CGAAGCAGGC GGAAGCCGAT 650 

55 ATCGTGATCA TTGCAGCCTG TGGCGAGCGT GCAAACGAGG TTGTGGAGAT 700 

CTTTGCGGAA TTCCCCCACC TGAATGACCC CCACACGGGA CGTAAATTGA 750 

TGGAACGTAC CATTATTATT GCTAATACGT CGAATATGCC TGTGGCTTCG 800 

CGTGAGGCAT CCGTATATAC GGCCATGACG ATAGCCGAGT ACTATCGCTC 850 

CATGGGCCTT CGCGTGCTGA TGATGGCAGA CTCCACTTCG CGTTGGGCAC 900 

60 AGGCTCTGCG TGAGATGTCT AACCGTCTGG AAGAGCTTCC CGGACCGGAT 950 

407 
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GCTTTCCCGA TGGACTTGTC AGCTATCGTA 
AGGATACGTT TACCTGAACA ACGGTTCGGC 
GTACGGTATC TCCCGCCGGT GGTAACCTCA 
ACCAAGAAAG TGGCTCGCTG CTTCTATGCT 
5 CCGCAAACGT TATCCGGCTG TAAAGCCCAT 
TCGAATATCC CGAATTCGAG AGCTATATAT 
ATTATATTCA ACAGAAGTTG GCAGATATAT 
CTGGACTACT AAGGTGAATG AGCTGAAGAT 
AAAT 



PCT/CA00/01150 

GCCAACTTCT ACGbTCGTGC L0OO 

CGGTTCGGTA ACGTTCATCG 1050 

AAGAGCCTGT GACGGAAAAC 1100 

TTGGAGCAGA ATCGTGCCGA 1150 

CGATAGTTAC TCGAAGTACA 1200 

CGAACCACAT CAGTTTACTC 1250 

GGATCAAATC TTACAGCAGA 1300 

GCGCTTGCAT CAGGGTAAAG 1350 

1354 



2 ) INFORMATION FOR SEQ ID NO: 716 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1788 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
25 (B) STRAIN: Type 4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 716 

TTTGACTCAA GGGAAGATTA TAAAAGTATC GGGACCTCTA GTTATTGCAT 50 

30 CAGGTATGCA GGAGGCTAAT ATTCAAGATA TTTGCCGTGT AGGTAAGCTA 100 

GGGTTAATCG GTGAAATTAT TGAAATGAGA AGAGATCAGG CATCTATCCA 150 

AGTCTATGAA GAAACATCTG GTCTTGGTCC GGGAGAACCT GTTGTTACAA 200 

CTGGAGAACC TCTCTCGGTT GAATTAGGGC CAGGATTGAT TTCTCAAATG 250 

TTTGATGGCA TACAACGCCC ATTAGATCGA TTTAAATTGG CTACTCATAA 30 0 

35 TGATTTTCTA GTTCGTGGGG TAGAAGTTCC AAGTTTGGAT AGAGATATTA 350 

AGTGGCATTT TGATTCCACT ATAGCAATTG GTCAAAAAGT GAGTACGGGT 400 

GATATTCTTG GAACTGTCAA GGAAACCGAG GTAGTTAATC ATAAAATTAT 450 

GGTTCCTTAT GGAGTATCTG GAGAAGTCGT TTCTATTGCA TCTGGCGATT 500 

TTACAATTGA TGAAGTTGTA TATGAAATAA AAAAATTGGA CGGTAGTTTC 550 

4 0 TATAAAGGAA CGCTTATGCA AAAATGGCCT GTCCGCAAGG CGCGTCCTGT 600 

TTCTAAACGT TTAATTCCAG AAGAACCATT AATCACAGGT CAACGAGTTA 650 

TTGATGCATT CTTTCCAGTA ACCAAAGGGG GAGCTGCAGC AGTTCCTGGA 700 

CCGTTTGGAG CAGGAAAGAC AGTTGTACAA CACCAAGTAG CTAAATTTGC 750 

CAATGTTGAT ATTGTTATTT ATGTCGGTTG TGGAGAACGT GGAAATGAAA 800 

45 TGACGGATGT ACTGAATGAG TTTCCTGAGT TGATTGACCC TAATACCGGA 850 

CAATCAATTA TGCAACGGAC AGTTCTGATT GCTAATACTT CAAATATGCC 900 

TGTTGCTGCT CGTGAGGCTT CAATTTATAC AGGAATTACC ATGGCTGAGT 950 

ATTTTCGTGA TATGGGCTAC TCTGTCGCCA TTATGGCTGA TTCAACTTCA 1000 

CGTTGGGCAG AAGCGCTACG TGAAATGTCA GGACGTCTAG AAGAAATGCC 1050 

50 TGGTGATGAG GGTTATCCTG CTTATCTGGG AAGTCGTATC GCTGAATATT 1100 

ATGAAAGAGC AGGACGTTCT CAGGTTCTAG GGCTTCCAGA ACGTGAAGGA 1150 

ACGATTACTG CTATTGGAGC TGTATCGCCA CCTGGTGGAG ATATTTCAGA 1200 

ACCAGTTACT CAAAACACTT TACGGATTGT GAAAGTTTTT TGGGGGCTTG 1250 

ATGCTCCGTT GGCACAGCGA CGTCATTTTC CTGCAATTAA CTGGCTTACA 1300 

55 TCTTATTCAC TATATAAAGA CAGTGTGGGC ACTTATATAG ATGGTAAAGA 1350 

GAAGACAGAT TGGAATAGTA AAATAACTCG TGCGATGAAC TACTTACAAC 1400 

GGGAATCTAG TTTAGAGGAA ATTGTTCGTC TTGTTGGAAT TGATTCTCTG 1450 

TCTGATAATG AACGACTAAC GATGGAAATT GCTAAACAAA TTCGAGAAGA 1500 

TTATTTGCAA CAGAACGCTT TTGATTCGGT AGATACATTC ACTTCGTTTG 1550 

60 CAAAACAAGA AGCAATGCTA AGTAATATTC TCACTTTTGC TGATCAGGCA 1600 

408 
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AATCATGCTT TAGAGTTGGG TTCTTACTTT 
CGTGGCAGTT CGAGACCGTA TGGCGAGAAG 
GATTAGATGA AATCAAAATT ATATCAAATG 
TTGATATTAG AAACAGGAGG TCTATAAATG 



PCT/CA00/01150 

ACAGAGATTA TGGSUIGGTAC 1450 

TAAATATGTT TCAGAAGATA 1700 

AGATTACACA TCAAATTCAT 1750 

AGTGTTAT 1788 



2) INFORMATION FOR SEQ ID NO: 717 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Burkholderia mallei 
20 (B) STRAIN: GB8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 717 

CGATCCTGGT GTGCTCGGCC GCTGACGGCC CGATGCCGCA AACGCGTGAG 50 

25 CACATCCTGC TGGCGCGTCA GGTCGGTGTG CCGTACATCA TCGTGTTCCT 100 

GAACAAGTGC GACATGGTGG ACGACGCGGA GCTGCTCGAG CTGGTCGAAA 150 

TGGAAGTGCG CGAACTGCTG TCGAAGTACG ACTTCCCGGG CGACGACACG 200 

CCGATCATCA AGGGTTCGGC GAAGCTGGCG CTGGAAGGCG ACAAGGGCGA 250 

GCTGGGCGAA GTGGCGATCA TGAACCTGGC CGACGCGCTG GACACGTACA 300 

30 TCCCGACGCC GGAGCGTGCG GTCGACGGCG CGTTCCTGAT GCCGGTGGAA 350 

GACGTGTTCT CGATCTCGGG CCGTGGTACG GTGGTGACGG GTCGTGTCGA 400 

GCGCGGCGTG ATCAAGGTTG GCGAGGAAAT CGAAATCGTC GGTATCAAGG 450 

CGACGGCGAA GACGACCTGC ACGGGCGTGG AAATGTTCCG CAAGCTGCTG 500 

GACCAGGGTC AGGCGGGCGA CAACGTCGGT ATCCTGCTGC GCGGCACAAA 550 

35 GCGTGAAGAC GTGGAGCGCG GCCAGGTTCT GGCGAAGCCG GGTTCGATCA 600 

CGCCGCACAC GCACTTCACG GCAGAAGTGT ACGTGCTGAG CAAGGACGAA 650 

GGCGGCCGCC ACACGCCGTT CTTCAACAAC TACCGTCCGC AGTTCTACTT 700 

CCGTACGACG GACGTGACGG GCTCGATCGA GCTGCCGAAG GACAAGGAAA 750 

TGGTGATGCC GGGCGACAAC GTGTCGATCA CGGTGAAGCT GATCGCGCCG 800 

40 ATCGCGATGG AAGAAGGTCT GCG 823 



2) INFORMATION FOR SEQ ID NO: 718 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 
CO STRANDEDNESS: Double 

50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Burkholderia pseudomallei 

(B) STRAIN: 1026B 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 718 

60 GCGATCCTGG TGTGCTCGGC CGCTGACGGC CCGATGCCGC AAACGCGTGA 50 

409 
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GCACATCCTG CTGGCGCGTC AGGTCGGTGT 
TGAACAAGTG CGACATGGTG GACGACGCGG 
ATGGAAGTGC GCGAACTGCT GTCGAAGTAC 
GCCGATCATC AAGGGTTCGG CGAAGCTGGC 
5 AGCTGGGCGA AGTGGCGATC ATGAACCTGG 
ATCCCGACGC CGGAGCGTGC GGTCGATGGC 
AGACGTGTTC TCGATCTCGG GCCGTGGTAC 
AGCGCGGCGT GATCAAGGTT GGCGAGGAAA 
GCGACGGCGA AGACGACCTG CACGGGCGTG 

10 GGATCAGGGT CAGGCGGGCG ACAACGTCGG 
AGCGTGAAGA CGTGGAGCGC GGCCAGGTTC 
ACGCCGCACA CGCACTTCAC GGCTGAAGTG 
AGGCGGCCGC CACACGCCGT TCTTCAACAA 
TCCGTACGAC GGACGTGACG GGCTCGATCG 

15 ATGGTGATGC CGGGCGACAA CGTGTCGATC 
GATCGCGATG GAAGAAGGTC TGCG 



PCT/CA00/01150 

GCCGTACATC AT<SGTGTTCC /LOO 

AGCTGCTCGA GCTGGTCGAA 150 

GACTTCCCGG GCGACGACAC 200 

GCTGGAAGGC GACAAGGGCG 250 

CCGACGCGCT GGACACGTAC 300 

GCGTTCCTGA TGCCGGTGGA 350 

GGTGGTGACG GGTCGTGTCG 400 

TCGAAATCGT CGGTATCAAG 450 

GAAATGTTCC GCAAGCTGCT 500 

TATCCTGCTG CGCGGCACGA 550 

TGGCGAAGCC GGGTTCGATC 600 

TACGTGCTGA GCAAGGACGA 650 

CTACCGTCCG CAGTTCTACT 700 

AGCTGCCGAA GGACAAGGAA 750 

ACGGTGAAGC TGATCGCGCC 800 

824 



20 2) INFORMATION FOR SEQ ID NO: 719 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 800 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium beijerincki (deposited as 
Clostridium butyricum) 

(B) STRAIN: ATCC 8260 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 719 

TGTATCAGCA GCAGATGGTC CAATGCCACA AACAAGAGAA CATATACTAC 50 

TAGGATCAAG AGTTGGTATC CAATATATCG TAGTATTCTT AAATAAAGCA 100 

GATATGGTAG ACGATCCAGA ATTATTAGAA TTAGTAGAAA TGGAAGTAAG 150 

40 AGAATTATTA AGCGAATATG ACTTCCCAGG AGACGATATT CCAGTAATAA 200 

CAGGATCAGC ATTAAAAGCA TTAGAAAATC CAACAGATGA AGAAGCAATT 250 

AAGCCAATCA TGGATTTAAT GGAAGCAGTA GATAGCTATA TCCCAACTCC 300 

AGAAAGAGCA ACAGATAAGC CATTCTTAAT GCCAATCGAA GATGTATTCA 350 

CAATTACAGG AAGAGGAACA GTTGCAACAG GAAGAGTTGA AGCTGGAGTA 400 

45 CTTCATGTAG GAGATGAAGT AGAAATCGTT GGATTAACAG AAGAAAAGAA 450 

GAAAGTTGTA GTAACTGGAA TCGAAATGTT CAGAAAGTTA TTGGATGAAG 500 

CACAAGCTGG AGATAACATC GGAGCATTAT TAAGAGGAGT TCAAAGAACT 550 

GATATTGAAA GAGGTCAAGT TTTATCAAAA CCAAATTCAG TACACCCTCA 600 

CACTAAATTT GTAGGTCAAG TATACGTACT TAAAAAAGAA GAAGGTGGAA 650 

50 GACATACTCC ATTCTTTGAT GGATACAGAC CACAATTCTA TTTCAGAACA 700 

ACAGACGTTA CAGGRTCAAT CAAGTTACCA GATGGAATGG AAATGGTAAT 750 

GCCTGGAGAT CACATTGATA TGAATGTTGA ATTAATCACT CCAATCGCAA 800 



55 

2 ) INFORMATION FOR SEQ ID NO: 720 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 799 bases 
60 (B) TYPE: Nucleic acid 

410 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium innocuum 

(B) STRAIN: ATCC 14501 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 720 

GGTGCTATCC TGGTTGTTGC TGCATCTGAT GGTCCTATGC CTCAGACTCG 50 

TGAGCACATC CTGCTTGCTC GTCAGGTAGG TGTTCCTTAC ATCGTTGTAT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAACTGAT CGACCTTGTT 150 

15 GAAATGGAAG TACGTGAGCT GTTAAGCGAG TACGGATTCG ACGGAGATAA 200 

CGCTCCGGTT ATCCGTGGTT CTGCACTGAA GGCTCTGGAA GGTGACGACA 250 

AATACGTTGG CGCTAT CAAA GAACTGATGG ATGCAGTTGA TGAATTCATC 300 

CCAGATCCAA CTCGTGAAAC TGACAAACCA TTCCTGATGT CTGTAGAAGA 350 

CGTTATGACA ATCACAGGAC GTGGTACAGT TGCTACAGGA CGTGTTGAGC 400 

20 GTGGGGTAGT AAAACTGGGA GAAGAAGTTG AAATCGTTGG TATCAAGGAT 450 

ACTCAGAAAA CTGTTGTTAC CGGACTGGAA ATGTTCCGTA AGCAGCTGGA 500 

CTTCGCAGAA TCCGGAGACA ACATCGGTGC TCTGCTGCGT GGTATCAACC 550 

GTGACCAGAT TCAGCGTGGA CAGGTTCTTG CTAAACCAGG ATCCGTACAT 600 

CCACACACAA AGTTCAAGGC TCAGGTTTAT GTATTAACAA AAGAAGAAGG 650 

25 TGGACGTCAC ACTCCATTCG TTTCTAACTA CCGTCCTCAG TTCTACTTCC 700 

GTACAACTGA CGTAACTGGT GTTATTACAT TACCGGAAGG AACTGAAATG 750 

GTTATGCCTG GTGACAACGT TGAAATGAAC GTTGAGC TGA TTGCTCCAA 799 



30 



40 



2) INFORMATION FOR SEQ ID NO: 721 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 789 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium novyi 

(B) STRAIN: ATCC 19402 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 721 

TTGTATCAGC AGCAGATGGT CCAATGCCAC AAACAAGAGA ACACATCCTA 50 

TTAGCATCAA GAGTAGGAGT TAACCACATA GTAGTATTCT TAAACAAAGC 100 

AGACCAAGTA GATGATCCAG AATTACTAGA ATTAGTAGAA ATGGAAGTAA 150 

50 GAGAATTATT AAGCGAATAC GGATTTGACG GAGACGAATG TCCAGTAGTA 200 

GTAGGATCAG CATTAAAAGC AATCGAAGAA GGGGATGACC AATGCATCCT 250 

AGACTTAATG AAAGCTGTAG ATGAATATAT CCCAACTCCA GAAAGAGCAA 300 

CAGATCAACC ATTCTTAATG CCTGTAGAAG ATGTATTTAC AATTACAGGA 350 

AGAGGAACAG TTGCAACAGG AAGAGTTGAA AGAGGAGTAC TACACGTAGG 4 00 

55 AGATGAAGTA CAAATCGTAG GAATGAAAGA AGAAATCGGA AAGACAACAA 450 

TCACAGGAGT AGAAATGTTC AGAAAGATGT TAGATGAAGC AATGGCTGGA 500 

GATAACATCG GAGCATTATT AAGAGGAGTA CAAAGAGACG AAATCGAAAG 550 

AGGTCAAGTA CTAGCAAAAC CAGGTTCAGT AACACCTCAC AAAAAATTCG 600 

■TAGGTCAAGT TTACGTATTA AAGAAAGAAG AAGGTGGAAG ACACACTCCA 650 

60 TTCTTTAACG GATACAGACC ACAATTCTAC TTCAGAACAA CAGACGTAAC 700 

411 
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AGGATCAATC GCTTTACCAG AAGGAGTAGA AATGGTAATG CCAGGAGACC 75JD 
ATATAGACAT GAACGTAGAA TTAATCACAC CAGTAGCAA 789 



2 ) INFORMATION FOR SEQ ID NO: 722 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 798 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium septicum 

(B) STRAIN: ATCC 124 64 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 722 

GTTCAGCAGC AGACGGTCCA ATGCCACAAA CAAGAGAACA TATACTACTA 50 

GCATCAAGAG TTGGTGTTGA CTATAT CGTA GTATTCTTAA ACAAGGCAGA 100 

TATGGTAGAT GACGAAGAAT TATTAGAATT AGTAGAAATG GAAGTTAGAG 150 

25 AATTATTATC AGAATACAAC TTCCCAGGAG ATGATATTCC AGTAATCAAG 2 00 

GGATCAGCTT TAGTAGCATT AGAAAACCCA ACAGATGAAA AATCAATCGC 250 

TCCAATCTTA GAATTAATGG AAGCAGTAGA TAGCTACATT CCAACACCAG 3 00 

AAAGAGCAAC AGATAAGCCA TTCTTAATGC CAGTAGAAGA TGTATTCACA 350 

ATAACTGGTA GAGGAACAGT TGCAACAGGA AGAGTTGAAA GAGGAGTTCT 400 

30 TCATGTAGGA GACGAAGTAG AAATCGTTGG ATTATCAGAA GAAAGCAGAA 450 

AAGTAGTAGT AACAGGAATA GAAATGTTCA GAAAGTTACT AGACGAAGCA 500 

CAAGCTGGAG ATAATGTTGG AGTACTTTTA AGAGGTGTTC AAAGAACAGA 550 

TATCGAAAGA * GGTCAAGTAT TAGCAAAGAC TGGATCAGTT AAGCCACACA 600 

GCAAGTTCGT AGGTCAAGTA TACGTACTTA AGAAAGAAGA AGGTGGAAGA 650 

35 CATACTCCAT TCTTCGATGG ATACAGACCA CAATTCTACT TCAGAACAAC 700 

AGACGTTACT GGATCAATCA AATTACCAGA CGGAATGGAA ATGGTTATGC 750 

CAGGAGACCA CATTGATATG AACGTTGAAT TAATCACTCA AGTAGCAA 798 



40 



50 



2) INFORMATION FOR SEQ ID NO: 723 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH; 799 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium tertium 

(B) STRAIN: ATCC 14573 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 723 

GTTCAGCAGC AGATGGTCCA ATGCCTCAAA CAAGAGAACA CATACTACTA 50 

GCTTCAAGAG TTGGTGTTGA CTACATAGTT GTTTTCTTAA ACAAGGCAGA 100 

TATGGTAGAT GACGAAGAAT TATTAGAATT AGTTGAAATG GAAGTAAGAG 150 

60 AATTATTATC AGAATACAAC TTCCCAGGAG ATGATATTCC AGTAATAAAG 200 

412 
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GGTTCAGCTT 
TCCAATCCTT 
AAAGAGCAAC 
ATCACTGGTA 
TCACGTAGGA 
AAGTAGTAGT 
CAAGCTGGAG 
CATCGAAAGA 
AGAAATTTGT 
CATACTCCAT 
AGAYGTTACT 
CAGGAGACCA 



TACAAGCATT 
GAGTTAATGG 
AGATAAGCCA 
GAGGAACAGT 
GACGAAGTAG 
AACAGGAATA 
ACAACGTAGG 
GGTCAAGTTT 
AGGTCAAGTA 
TCTTCGATGG 
GGTTCAATCA 
CATTGATATG 



AGAAAACCCA 
AAGCTGTAGA 
TTCTTAATGC 
TGCTACAGGA 
AAATCGTTGG 
GAAATGTTCA 
AGTTCTTTTA 
TAGCAAAAGT 
TACGTACTTA 
ATACAGACCA 
AGTTACCAGA 
AACGTTGAAT 



ACAGATGAAA ^AGCAATCGC 
TAGCTACATT CCAACTCCAG 
CAGTAGAAGA TGTATTCACA 
AGAGTTGAAA GAGGAGTTCT 
ATTATCAGAA GACAGCAGAA 
GAAAGTTACT AGACGAAGCG 
AGAGGAGTTC AAAGAACTGA 
TGGATCAGTT AAGCCACACA 
AAAAAGAAGA AGGTGGAAGA 
CAATTCTACT TCAGAACAAC 
TGGAATGGAA ATGGTTATGC 
TAATCACTCA AGTAGCTAT 



250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
799 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 724 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 801 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium tetani 

(B) STRAIN: ATCC 19406 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 724 

TAGTAAGTGC AGCAGATGGT. CCAATGCCAC AAACAAGAGA ACACATACTA 50 

TTAGCATCCA GAGTTGGAGT TGAGCACATA v GTAGTATTCT TAAATAAAGC 100 

AGACCAAGTA GATGACGCAG AGTTAATCGA ATTAGTAGAA ATGGAAGTAA 150 

GGGAATTAAT GAACGAATAC GGATTCCCAG GAGATGACGC ACCAGTAGTA 200 

GTAGGATCCG CATTAAAAGC ATTAGAAAAT CCAGAAGATG ATGCAGCAAC 250 

ACAATGCATA ATGGACTTAA TGGCAGCAGT AGATGAATAT ATACCAACAC 300 

CAGAAAGAGC AACAGATAAG CCATTCTTAA TGCCAGTAGA AGATATCTTC 350 

ACAATCACAG GAAGAGGAAC AGTTGCAACA GGAAGAGTAG AAAGAGGAAT 400 

TCTAAAAGTA GGAGACGAAA TAGAAATCGT AGGATTAAGT GATGAAAGCA 450 

AGAAATCAGT AATCACAGGA ATAGAAATGT TCAGAAAACT ATTAGATGAA 500 

GCACAAGCAG GAGATAACAT CGGAGCATTA TTAAGAGGTG TTCAAAGAGA 550 

TGAAATCCAA AGAGGTCAAG TATTAGCAGC AACAGGATCA GTAAAACCAC 600 

ATAAGAGTTT TACAGGTCAA GTATATGTAT TAAAGAAAGA AGAAGGAGGA 650 

AGACACACTC CATTCTTTAA CGGATACAGA CCACAATTCT ACTTTAGAAC 700 

AACAGACGTA ACAGGTTCAA TCGCACTACC AGAAGGAGTA GAAATGGTAA 750 

TGCCAGGAGA CCACATAGAC ATGAAGGTAG AATTAATAAC AAGAGTAGCA 800 

A 801 



2) INFORMATION FOR SEQ ID NO: 725 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 633 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



413 
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5 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus malodoratus 

(B) STRAIN: ATCC 43197 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 725 



TATGCCTCAA ACTCGTGAAC ACATCTTGTT ATCTCGTAAC GTTGGTGTTC 50 

CTTACATCGT TGTATTCTTA AACAAAATGG ATATGGTTGA TGATGAAGAA 100 

10 TTACTAGAAT TAGTTGAAAT GGAAGTTCGT GACTTATTGT CAGAATACGA 150 

CTTCCCAGGC GACGACACTC CAGTTATCGC TGGTTCAGCT TTGAAAGCTT 200 

TAGAAGGCGA TGCTTCATAC GAAGAAAAAA TCTTAGAATT AATGGCTGCT 250 

GTTGATGAAT ATATCCCAAC ACCAGTTCGT GATACTGACA AACCATTCAT 300 

GATGCCAGTC GAAGATGTAT TCTCAATCAC TGGTCGTGGA ACTGTTGCAA 350 

15 CTGGTCGTGT TGAACGTGGA CAAGTTCGCG TTGGTGACGA AGTTGAAATC 400 

GTTGGTATTG CTGAAGCAAC TGCTAAAACA ACTGTTACAG GTGTTGAAAT 450 

GTTCCGTAAA TTGTTAGATT ACGCTGAAGC AGGCGATAAC ATTGGTGCAT 500 

TGTTACGTGG TGTTGCACGT GAAGACATCC AACGTGGACA AGTATTGGCT 550 

AAACCAGCTT CAATCACTCC ACATACAAAA TTCTCTGCAG AAGTTTACGT 600 

20 TTTAACTAAA GAAGAAGGCG GACGTCATAC TCC 633 



2) INFORMATION FOR SEQ ID NO: 726 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 623 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Enterococcus sulfureus 

(B) STRAIN: ATCC 49903 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 726 

40 CACGTGAACA CATCTTGTTA TCTCGTAACG TAGGTGTTCC TTACATCGTT 50 

GTATTCTTAA ACAAAATGGA TATGGTTGAT GACGAAGAAT TATTAGAATT 100 

AGTAGAAATG GAAGTTCGTG ACTTATTATC AGAATACGAT TTCCCAGGCG 150 

ATGACACTCC AGTTGTTGCA GGTTCTGCTT TGAAAGCTTT AGAAGGCGAC 200 

GCTTCTTACG AAGAAAAAAT CATGGAATTA ATGGCTGCAG TTGACGAGTA 250 

45 CATCCCAACT CCAACTCGTG ACACTGACAA ACCATTCATG ATGCCAGTTG . 300 

AGGATGTATT CTCAATCACT GGACGTGGTA CTGTTGCTAC AGGTCGTGTT 350 

GAACGTGGAC AAGTTCGCGT TGGTGACGTT GTAGATATCG TTGGTATCGC 400 

TGACGAAACT GCTCAAACAA CTGTAACAGG TGTTGAAATG TTC CGTAAAT 450 

TATTAGACTA CGCTGAAGCA GGCGATAACA TCGGTGCTTT ATTACGTGGT 500 

50 GTTGCTCGTG AAGACATCCA ACGTGGACAA GTTTTAGCTA AACCAGCTTC 550 

AATCACTCCA CATACAAAAT TCTCTGCTGA AGTATACGTA TTAAGCAAAG 600 

AAGAAGGTGG ACGTCATACT CCA 623 



55 

2) INFORMATION FOR SEQ ID NO: 727 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 646 bases 
60 (B) TYPE: Nucleic acid 

414 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus garvieae 

(B) STRAIN: ATCC 49156 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 727 

CGGTCCTATG CCTCAAACTC GTGAACACAT CTTGTTATCA CGTAACGTTG 50 

GCGTACCATA CATCGTTGTT TTCTTGAACA AAATGGATAT GGTTGATGAC 100 

GAAGAATTGC TAGAATTAGT TGAAATGGAA GTTCGTGACC TATTGTCTGA 150 

15 ATATGACTTC CCAGGCGACG ATGTTCCTGT AATCGCTGGT TCTGCTTTGA 200 

AAGCTCTTGA AGGAGATCCT TCATACGAAG AAAAAATCAT GGAATTGATG 2 50 

GCTGCAGTTG ACGAATACGT TCCAACTCCA GAACGTGATA CTGACAAACC 3 00 

ATTCATGATG CCAGTCGAAG ACGTATTCTC AATCACTGGA CGTGGTACTG 350 

TTGCTACAGG CCGTGTTGAA CGTGGACAAG TTCGCGTTGG TGATGAAGTA 400 

20 GAAATCGTTG GTATTGCTGA CGAAACTGCT AAAACAACTG TAACAGGTGT 450 

TGAAATGTTC CGTAAATTGT TAGACTATGC TGAAGCAGGG GATAACATTG 500 

GTGCATTGCT ACGTGGGGTT GCTCGTGAAG ACATCCAACG TGGACAAGTA 550 

TTGGCTAAAG CTGGTACAAT CACACCTCAT ACAAAATTCA AAGCTGAAGT 600 

TTATGTTTTG ACAAAAGAAG AAGGTGGACG TCACACTCCA TTCTTC 646 



25 



40 



60 



2) INFORMATION FOR SEQ ID NO: 728 



(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycoplasma pirum 

(B) STRAIN: ATCC 25960D 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 728 



GAGCAATTTT AGTTGTTTCT GCAACTGATG GTCCAATGCC TCAAACTCGT 50 

GAACATATCT TATTAGCACG CCAAGTTGGT GTTCCTAAAA TGGTTGTTTT 100 

45 CTTAAACAAA TGTGATGTTG CTTCTGATCC AGAAATGCAA GAATTAGTTG 150 

CTGAAGAAGT AAAAGACTTA TTAAAATCTT ATGGTTTTGA TGGTGACAAT 2 00 

ACTCCAATTA TTCGTGGTTC AGCATTAGAA GCATTAAATG GTAAACCTGA 250 

ATGAGAAGAA AAAATTAAAG AATTAATGAA GGCAGTGGAT GACACTATTC 3 00 

CTGATCCAGT TCGTGATACT GAAAAGCCAT TCTTGTTACC AATTGAAGAC 350 

50 GTAATGACAA TTACAGGTCG TGGTACTGTT GTTACAGGTC GTGTAGAACG 400 

TGGTACTCTA AAATTAAATG ATGAAGTTGA AATTGTTGGT TTAGGTGAAA 450 

CATTTAAATC TGTTGTAACA GGTATTGAAA TGTTCCGTAA AGAATTAGAT 500 

GAAGCTCGTG CTGGTGACAA TGCTGGTATT TTACTTCGTG GTGTTGACCG 550 

TGGTCAAGTA CAACGTGGTC AAGTTCTTGC TAAACCAGGT TCTATTACTC 600 

55 CTCATACTAA ATTTAAAGCT GAAATTTATG CTTTGAAAAA AGAAGAAGGT 650 

GGTCGTCATA CTGCTTTCTT AAACGGTTAT CGTCCTCAAT TCTATTTCAG 700 

AACAACTGAT GTTACAGGTT CTATTAAATT AAAAGATGGA ACTGAAATGG 750 

TTATGCCTGG TGACAATACT GAAATCACTG TAGAATTAAT TTCACCAATT 800 

GCTTGTAAAA AGGAAGTAAG TTT 823 



415 
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2) INFORMATION FOR SEQ ID NO: 729 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycoplasma salivarium 
15 (B) STRAIN: ATCC 23064 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 729 

GGAGCAATCT TAGTTGTTGC TGCAACTGAT GGTGCGATGC CTCAAACTCG 50 

20 TGAACACGTT TTACTTGCAA AACAAGTTGG TGTTCCTAAA ATCGTTGTTT 100 

TCTTAAACAA AATCGATATG TTCAAGCCAG AAGAAAGAGC CGAAATGGTT 150 

GAAATGGTTG AAATGGACAT TCGTGACTTA TTAAACAAAT ACGACTTTGA 200 

TGGTGACAAT ACTCCTATTA TTGCTGGTTC AGCATTAAAA GCATTACAAG 250 

GTGATCCAGA ATATGAAAAG AATATTCTAG AACTTATGGA TGCAGTAGAC 300 

25 ACATACATTG ATGAACCTAC TCGTGATAAA GATAAACCAT TCTTAATGGC 350 

TGTAGAAGAC GTATTTACAA TTACTGGTCG TGGAACTGTT GCTACTGGTA 400 

GAGTAGAACG TGGTACATTA CATCTAAATG ATGAAGTTGA AATCGTTGGT 450 

CTACGTCCAA CAATTAAAAC TGTTGTTACT GGAATTGAAA TGTTCCGTAA 500 

AAATTTAAAA GAAGCTCAAG CTGGAGATAA TGCAGGACTT TTACTACGTG 550 

30 GAATTGATAG AGATCAAGTA GAACGTGGAC AAGTTTTGGC CAAACCAAAA 600 

AGTATTATTC CTCACACAGA ATTTGAAGCT * GCTGTGTATG TTCTAAAAGC 650 

TGAAGAAGGT GGACGTCACA CTCCATTCTT TGAACACTAT AAACCACAAT 700 

TTTACTTTCG TACAACCGAC GTTACTGGTG GAATTAAATT CAAACCTGGA 750 

CGTGAAATGG TTATGCCTGG CGAAAATGTT GAATTTACAG TTACTTTAAT 800 

35 TGCTCCTATT GCAGTTGAAG AAGGAA 826 



2) INFORMATION FOR SEQ ID NO: 73 0 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Neisseria polysaccharea 

(B) STRAIN: ATCC 43768 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 730 

55 TGGTATGTTC CGCAGCYGAY GGYCCTATGC CTCAAACTCG CGAACACATC 50 

CTGYTGGCTC GCCAAGTAGG YGTACCTTAC ATCATCGTRT TCATGAACAA 100 

ATGCGACATG GTTGACGATG CCGAGCTGTT GGAACTGGTT GAAATGGAAA 150 

TCCGYGACCT GYTGTCMAGC TACGACTTCC CMGGCGACGA CTGCCCAATC 200 

GTACAAGGTT CTGCACTGAA AGCTTTGGAA GGYGAYGCTG GTTACGAAGA 250 

60 GAAAATCTTC GAAYTGGCTG CTGCTTTGGA CAGCTACATC CCAACTCCTG 300 
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AGCGTGCTGT 
ATCTCTGGYC 
CCACGTTGGT 
CCACTTGTAC 
GCWGGTGACA 
AGAGCGTGGT 
AATTCAAAGC 
ACTCCATTCT 
CGTAACYGGT 
GTGAGAACGT 
AAGGTCTGCG 



GGACAAACCK 
GYGGTACMGT 
GACGAGATYG 
CGGCGTTGAA 
ACGTAGGCGT 
CAAGTATTGG 
AGAAGTATAC 
TCGCYAACTA 
GCAGTTACTT 
AACCATTACY 



TTCYTGYTGC 
AGTAACYGGY 
AAATCGTAGG 
ATGTTCCGCA 
ATTGYTGCGT 
CYAAACCAGG 
GTACTGAGCA 
CCGYCCMCAA 
TGGAAGAAGG 
GTAGAACTGA 



CTATCGAAGA CGTRTTbTCT 
CGTGTAGAGC GCGGTRTCAT 
TCTGAAAGAA ACCCAAAAAA 
AACTGCTGGA CGAAGGTCAA 
GGTACCAARC GTGAAGACGT 
YACCATYACT CCSCACACCA 
AAGAAGAGGG TGGTCGYCAC 
TTCTACTTCC GTACYACTGA 
TGTAGAAATG GTAATGCCAG 
TTGCGCCTAT CGCTAGGAAG 



350 
400 
450 
500 
550 
600 
650 
70Q 
750 
800 
810 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 731 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Enteritidis 

(B) STRAIN: ATCC 13076 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 731 

CTGGTTGTTG CTGCGACTGA • CGGCCCGATG CCGCAGACCC GTGAGCACAT 50 

CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG TTCCTGAACA 100 

AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT TGAAATGGAA 150 

GTTCGYGAAC TGCTGTCTCA GTACGACTTC CCGGGCGACG ACACTCCGAT 200 

CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA GAGTGGGAAG 250 

CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT CCCGGAACCA 300 

GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG ACGTATTCTC 350 

CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA CGCGGTATCA 400 

TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA GACTCAGAAG 450 

TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG ACGAAGGCCG 500 

TGCCGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA CGTGAAGAAA 550 

TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA GCCGCACACC 600 

AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG GCGGCCGTCA 650 

CACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC CGTACTACTG 700 
ACGTGACTGG CACCAT CGAA CTGCCGGAAG GCGTAGAGAT GGTAATGCCG . 750 

GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA TCGCAATGGA 800 

CGACGGTCTG CGT 813 



2) INFORMATION FOR SEQ ID NO: 732 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Gallinarum 
5 (B) STRAIN: ATCC 9184 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 732 

CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC GTGAGCACAT 50 

10 CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG TTCCTGAACA 100 

AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT TGAAATGGAA 150 

GTTCGYGAAC TGCTGTCTCA GTACGACTTC CCGGGCGACG ACACTCCGAT 2 00 

CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA GAGTGGGAAG 250 

CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT CCCGGAACCA 300 

15 GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG ACGTATTCTC 350 

CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA CGCGGTATCA 400 

TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA GACTCAGAAG 450 

TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG ACGAAGGCCG 500 

TGCCGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA CGTGAAGAAA 550 

20 TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA GCCGCACACC 600 

AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG GCGGCCGTCA 650 

CACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC CGTACTACTG 700 

ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT GGTAATGCCG 750 

GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA TCGCAATGGA 800 

25 CGACGGTCTG CG 812 



30 



45 



2) INFORMATION FOR SEQ ID NO: 733 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814- bases 

(B) TYPE: Nucleic acid : ' 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 

serotype Paratyphi B 
(B) STRAIN: ATCC 8759 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 733 



TGGTTGTTGC TGCGACTGAC GGYCCGATGC CGCAGACCCG TGAGCACATC 50 

CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT TCCTGAACAA 100 

ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT GAAATGGAAG 150 

TTCGYGAACT GCTGTCTCAG TACGACTTCC CGGGCGACGA CACTCCGATC 200 

50 GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG AGTGGGAAGC 250 

GAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTACATT CCGGAACCAG 3 00 

AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA CGTATTCTCC 350 

ATCTCTGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC GCGGTATCAT 400 

CAAAGTGGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAG ACTCAGAAGT 4 50 

55 CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA CGAAGGCCGT 500 

GCYGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC GTGAAGAAAT 550 

CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAG CCGCACACCA 600 

AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG CGGCCGTCAT 650 

ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC GTACTACTGA 700 

60 CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG GTAATGCCGG 750 
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GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT CGCAATGGAC 80tf 
GACGGTCTGC GTTT 814 



2) INFORMATION FOR SEQ ID NO: 734 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 828 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Virchow 

(B) STRAIN: ATCC 51955 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 734 



GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

25 TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGACTTCC CGGGCGACGA 200 

CACTCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG 250 

AGTGGGAAGC GAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTACATT 3 00 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 3 50 

30 CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC 4 00 

GCGGTATCAT CAAAGTGGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAG 450 

ACTCAGAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCCGGTGAGA ACGTAGGTGT- TCTGCTGCGT GGTATCAAAC 550 
GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CAC C ATCAAG . 600 

35 CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACTACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCAATGGAC GACGGTCTGC GTTTCGCA 828 

40 



2) INFORMATION FOR SEQ ID NO: 735 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia grimesii 
55 (B) STRAIN: ATCC 14460 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 735 

GGCGCTATCC TGGTTGTTGC TGCGACTGAT GGCCCAATGC CACAGACCCG 50 
60 TGAGCACATC CTGCTGGGTC GTCAGGTTGG CGTTCCTTTC ATCATCGTAT 100 

419 
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TCATGAACAA ATGCGACATG GTTGATGATG 
GAAATGGAAG TTCGTGAACT TCTGTCTGCT 
CCTGCCAGTT GTTCGTGGTT CAGCGCTGAA 
AGTGGGAAGC TAAAATCATC GAACTGGCTG 
5 CCAGAACCAG AGCGTGCTAT CGACAAGCCG 
CGTATTCTCC ATCTCCGGYC GTGGTACYGT 
GCGGTATCGT TAAAGTTGGC GAAGAAGTTG 
ACCGTTAAGT CTACCTGTAC TGGCGTTGAA 
CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT 

10 GTGAAGACAT CGAACGTGGT CAGGTACTGG 
CCACACACCA AATTCGACTC AGAAGTTTAC 
TGGTCGTCAC ACTCCATTCT TCAAAGGCTA 
GTACAACTGA CGTGACCGGT ACCATCGAAC 
GTAATGCCAG GCGATAACGT GAACATGGTT 

15 CGCGATGGAC GACGGTCTGC GTTTC 



PCT/CAOO/01150 

AAGAGCTGCT GGAACTGGTA f50, 

TATGACTTCC CTGGTGATGA 200 

AGCACTGGAA GGCGAAGCTG 250 

GCTACCTGGA TTCTTACATC 300 

TTCCTGCTGC CAATCGAAGA 350 

AGTTACCGGT CGTGTAGAGC 400 

AAATCGTTGG TATCAAAGAC 450 

ATGTTCCGCA AACTGCTGGA 500 

TCTGCTGCGT GGTATCAAGC 550 

CTAAACCAGG TTCAATCAAG 600 

ATCCTGAGCA AAGAAGAAGG 650 

CCGTCCACAG TTCTACTTCC 700 

TGCCAGAAGG CGTAGAGATG 750 

GTAACCCTGA TTCACCCAAT 800 

825 



2) INFORMATION FOR SEQ ID NO: 736 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Clostridium difficile 

(B) STRAIN: ATCC 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID. NO: 736 

35 TATTAGTTTG TTCAGCAACA GATGGACCAA TGC CACAAAC AAGAGAGCAT 50 

ATACTATTAT CAAGACAAGT TGGAGTACCA TATATAGTAG TATTCTTAAA 100 

CAAATGTGAC ATGGTAGATG ATGAAGAGTT ATTAGAGTTA GTAGAGATGG 150 

AAGTAAGAGA TTTATTAACA GAATATGATT TCCCAGGAGA TGACACTCCA 200 

ATAGTAAGAG GTTCAGCATT AATGGCATTA GAAGATCCAA AGAGYGAGTG 250 

40 GGGAGATAAG ATAGTAGAAT TATTCGAGCA AATAGATGAG TATATAC C AG 300 

CTCCAGAGAG AGATACAGAT AAACCATTCT TAATGCCAGT AGAGGACGTA 350 

TTCTCAATCA CAGGAAGAGG AACAGTTGCA ACAGGAAGAG TGGAAAGAGG 400 

AGTACTAAAA GTACAAGACG AAGTAGAKTT AGTAGGATTA ACAGAAGCAC 450 

CAAGAAAAGT AGTAGTAACA GGAGTAGAGA TGTTCAGAAA ATTATTAGAC 500 

45 CAAGCACAAG CAGGGGATAA TATAGGAGCA TTATTAAGAG GAGTACAAAG 550 

AAACGAGATA GAAAGAGGAC AAGTACTAGC AAAGACTGGA TCAGTAAAGG 600 

CACACACAAA GTTTACAGCA GAAGTATATG TACTTAAAAA AGAAGARGGT 650 

GGAAGACATA CACCATTCTT TGATGGATAT AGACCACAAT TCTATTTCAG 700 

AACAACAGAC GTAACAGGAG CTTGTAAGTT ACCAGAAGGA ATAGAGATGG 750 

50 TAATGCCTGG AGATAACGTA ACAATGGAAG TAGACTTAAT AAACTCAA 798 



2) INFORMATION FOR SEQ ID NO: 737 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Burkholderia pseudomallei 

(B) STRAIN: 1026B 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 7 



GTCAACATGA TGGAGCTCAT 
CTCCGTGTTC GCGGGCGTGG 
ACCACGAAAT GAAGGACTCG 
GGCCAGATGA ACGAGCCGCC 
CCTCACGATG GCCGAGCACT 
TCGTCGACAA CATCTACCGT 
CTGCTCGGCC GTATGCCGTC 
AGAAATGGGC AAGCTGCAAG 
TCACGTCGGT T 



CAACAACATC GCGAAGGAGC 
GCGAGCGTAC CCGTGAAGGG 
AACGTTCTCG ACAAGGTCGC 
GGGCAACCGT CTGCGCGTGG 
TCCGTGACGA AGGCCTCGAC 
TTCACGCTGG CCGGTACCGA 
GGCAGTGGGC TATCAGCCGA 
AGCGCATCAC GTCGACGAAG 



ACGGCGGTTA 50 

AACGACTTCT 100 

GCTGGTGTAC 150 

CGCTGACGGG 200 

GTGCTGTTCT 250 

AGTGTCGGCG 3 00 

CGCTGGCTGA 350 

AAGGGCTCGA 400 

411 



2) INFORMATION FOR SEQ ID NO: 738 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 394 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium bifermentans 

(B) STRAIN: ATCC 638 



(xi) SEQUENCE DESCRIPTION: SEQ 

TACAAGAGCT TATTAACAAT ATAGCTACTC 
TTCGCAGGTG TTGGAGAGAG AACAAGAGAA 
GATGAGCGAT ACAGGAGTTA TAAATAAAAC 
TGAATGAGCC ACCTGGAGCA AGAATGAGAG 
ATGGCTGAAT ACTTCAGAGA TCAACAAGGG 
AGATAATATA TTCCGTTTCA CTCAAGCAGG 
TTGGACGTAC TCCATCAGCA GTTGGATACC 
ATGGGTAGAT TACAAGAGAG AATAACATCT 



ID NO: 738 



AACACGGTGG 


TATAT CAGTA 


50 


GGTAACGACT 


TATTCCATGA 


100 


AGCTCTAGTA 


TTCGGACAAA 


150 


TTGCTTTAAC 


TGGTCTTACA 


200 


CAAGACGTTT 


TATTATTCGT 


250 


ATCTGAGGTT 


TCTGCACTTC 


300 


AACCAACATT 


AGCAACAGAG 


350 


ACAAATAAAG 


GGTC 


394 



2) INFORMATION FOR SEQ ID NO: 739 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 94 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Clostridium beijerincki (deposited as* 
Clostridium butyricum) 

(B) STRAIN: ATCC 8260 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 739 

TTAATAAACA ACATAGCTAA ACAACATGGT GGTTTATCAG TATTTACTGG 50 

AGTTGGTGAA AGATCAAGAG AAGGTAATGA CTTATATCAT GAAATGAGAG 100 

AGTCAGGAGT TATTGATAAG ACAGCATTAG TATTTGGACA AATGAATGAG 150 

10 CCACCGGGTG CCAGAATGAG AGTTGCATTA ACAGGTCTTA CTATGGCAGA 200 

GTATTTTAGA GATAAAGGTC AAGATGTGTT ACTATTCATA GATAACATAT 250 

TCAGATATAC TCAAGCAGGT TCAGAGGTTT CAGCATTACT TGGAAGAACA 300 

CCTTCAGCGG TTGGATATCA GCCAACACTT GCAACTGAAA TGGGTGCACT 350 

TCAGGAAAGA ATTACATCAA CAGTTAATGG TTCTATTACG TCAG 394 

15. 



2) INFORMATION FOR SEQ ID NO: 740 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 393 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 
30 (B) STRAIN: ATCC 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 740 

TTATAAACAA TATTGCTAAG CAACATGGTG GTATTTCTGT ATTTTCAGGA 50 

35 GTAGGAGAAA GAACAAGAGA AGGTAACGAC CTTTATGGCG AAATGAGTGA 100 

GTCTGGAGTT ATAAATAAAA CAGCTCTAGT ATTTGGTCAA ATGAATGAAC 150 

CACCTGGAGC GAGAATGAGA GTTGCTTTAA CTGGACTTAC AATGGCAGAA 200 

CATTTTAGAG ATGAGCAAGG ACAAGACGTT TTACTTTTCG TTGATAATAT 250 

ATTCCGTTTC ACACAAGCTG GTTCAGAAGT TTCAGCACTT CTAGGACGTA 300 

40 TGCCATCAGC TGTTGGTTAT CAGCCAACAT TAGCTACTGA AATGGGTGCA 350 

CTTCAAGAGA GAATAACATC AACTAAGAAA GGTTCAATAA CAT 393 



45 2) INFORMATION FOR SEQ ID NO: 741 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 398 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium ramosum 

(B) STRAIN: ATCC 25582 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 741 

422 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



TTGATTCAAG AATTCATTAA 
AGTTTTTGCT GGAGTTGGTG 
ATGAAATGAA GGAAAGTGGT 
CAGATGAATG AACCCCCAGG 
TACTATGGCA GAAGAATTCC 
TCATCGATAA TATTTTCCGT 
TTACTTGGAC GGGTACCATC 
CGAAATGGGT GCTTTACAAG 



TAACATTGCT 
AACGTAGCCG 
GTTTTATCTA 
AGCTCGTTTA 
GTGATGAACA 
TTTACTCAAG 
ACAAGCTGGG 
AACGGATTAC 



ACAGAACATG GTGGTTTATC 
TGAAGGTAAT GATTTATATT 
AAACAACACT AGTATTTGGA 
AGAGTTGCTT TAACGGGTCT 
AGGTCAGGAT GTCTTATTAT 
CTGGATCTGA .AGTATCTGCC 
TATCAGCCAA CTTTAGCAAC 
ATCAACTAAA AAAGGATC 



50 
100 
150 
200 
250 
300 
350 
398 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 742 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 380 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium septicum 

(B) STRAIN: ATCC 12964 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 742 

TAGCTAAGGA ACACGGTGGA CTTTCAGTAT TCACAGGTGT TGGAGAAAGA 50 

TCAAGAGAAG GTAATGATTT ATATTACGAA ATGAAAGAAT CAGGAGTTAT 100 

AGACAAGACA GCTCTAGTGT TTGGACAAAT GAATGAATCT CCAGGAGCTA 150 

GAATGAGAGT ATCTTTAACA GGATTAACTA TGGCTGAATA TTTCAGAGAT 2 00 

CAAGGTCAAG ATGTGCTTTT ATTCATAGAT AACATATTTA GATTTACTCA 2 50 

AGCTGGATCA GAAGTATCGG CTTTACTTGG AAGAATACCA TCAGCAGTTG 3 00 

GTTATCAACC AACACTAGCA ACTGAAATGG GTGCACTTCA AGAAAGAATT 3 50 

ACTTCAACTA AAAATGGATC AATAACTTCA 3 80 



2 ) INFORMATION FOR SEQ ID NO: 743 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium tertium 

(B) STRAIN: ATCC 14573 

l (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 743 

TTAATAAATA ATATAGCAAA AGAGCATGGT GGTCTTTCTG TATTTACAGG 50 

AGTTGGAGAA AGGTCAAGAG AAGGTAACGA CTTATATTAT GAAATGAAAG 100 

AGTCAGGGGT TATAGATAAG ACAGCTTTAG TATTTGGACA AATGAATGAA 150 

TCACCAGGAG CAAGAATGAG AGTTTCATTA ACTGGATTAA CTATGGCTGA 200 

ATATTTTAGA GATCAAGGTC AAGACGTTCT TTTATTTATA GATAATATAT 250 

TTAGATTTAC TCAAGCGGGA TCAGAAGTTT CTGCGTTATT AGGAAGAATT 3 00 



423 
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CCTTCAGCAG TTGGATATCA ACCAACTCTT GCAACTGAAA TGGGAGCACT 3 St) 

TCAAGAAAGA ATAACATCAA CAAAGAATGG ATCAATCAC 389 



2) INFORMATION FOR SEQ ID NO: 744 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 843 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Comamonas acidovorans 

(B) STRAIN: ATCC 15668 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 744 

TTCCCCCGCA CGCATGCCCA AGGTGTTCGA TGCCCTGAAG CTCGACGGCT 50 

CGGCCCTGAC GCTGGAAGTG CAGCAACTGC TGGGTGACGG CGTTGTGCGT 100 

ACCATCGCCC TGGGTTCGTC CGACGGTCTG CGTCGCGGCC TGATGGTGTC 150 

25 CAACACCGGC AACCCCATCA CCGTGCCCGT GGGCAAGGCG ACGCTGGGTC 200 

GCATCATGGA CGTGCTGGGC AATCCCATCG ACGAACGTGG TCCCGTGGAT 250 

CAGGCGCTGA CGGCTCCCAT CCACCGCAAG GCACCGGCTT ATGACGAGCT 300 

GTCGCCTTCG CAGGAACTGC TGGAAACCGG CATCAAGGTG ATCGACCTGA 350 

TCTCGCCCTT CGCCAAGGGC GGCAAGGTGG GTCTGTTCGG TGGCGCCGGT 400 

30 GTGGGCAAGA CCGTGAACAT GATGGAACTC ATCAACAACA TCGCCAAGGG 450 

CCACGGTGGT CTGTCGGTGT TCGCCGGTGT GGGTGAACGT ACCCGCGAAG 500 

GCAATGACTT CTATCACGAA ATGTCGGACG CCGGCGTGGT CAACCAGGAG 550 

TCGCTGAACG ACTCCAAGGT GGCCATGGTC TACGGCCAGA TGAACGAACC 600 

CCCGGGCAAC CGTCTGCGCG TGGCGCTGAC CGGCCTGACC ATGGCCGAAG 650 

35 CCTTCCGTGA CGAAGGCAAG GACGTGCTGT TCTTCGTGGA CAACATCTAC 700 

CGCTACACGC TGGCCGGTAC CGAAGTGTCC GCTCTGCTGG GTCGCATGCC 750 

TTCCGCCGTG GGCTACCAGC CCACGCTGGC CGAGGAAATG GGCCGCCTGC 800 

AAGAGCGCAT CACCTCGACC AAGGTCGGTT CGATCACTTC CAC 843 

40 

2) INFORMATION FOR SEQ ID NO: 745 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. 
rhinoscleromatis 
55 (B) STRAIN: ATCC 13884 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 745 

GCCGTACCAC GCGTGTACGA AGCCCTTGAG GTACAGAATG GTAATGAAGT 50 
60 TCTGGTGCTG GAAGTTCAGC AGCAGCTGGG CGGCGGTATC GTACGTACCA 100 

424 
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TCGCCATGGG TTCTTCTGAT GGTCTGCGCC 
CTCGAGCACC CGATCGAAGT CCCGGTAGGT 
CATGAACGTA CTGGGTCAAC CGGTTGACAT 
AAGAGCGTTG GGCTATCCAC CGCGCGGCAC 
5 AGCTCTCAGG AACTGCTGGA AACCGGCATC 
TCCGTTCGCC AAGGGCGGTA AAGTTGGTCT 
GTAAAACTGT AAACATGATG GAGCTGATCC 
TCCGGTTACT CTGTGTTTGC GGGCGTAGGT 
CGACTTCTAC CACGAAATGA CCGACTCCAA 

10 TGGTGTACGG CCAGATGAAC GAGCCGCCGG 
CTGACCGGCC TGACCATGGC TGAGAAATTC 
ACTGCTGTTC GTCGATAACA TCTATCGTTA 
TATCCGCGCT GCTGGGTCGT ATGCCTTCAG 
CTGGCGGAAG AGATGGGCGT TCTGCAGGAA 

15 CGGTTCTATC ACCTCCGTA 



PCT/CA00/01150 

GCGGTCTGGA TGTXAAAGAC 450 

AAAGCAACGC TGGGTCGTAT 2 00 

GAAAGGCGAC ATCGGCGAAG 250 

CGTCCTATGA AGAGCTGTCC 3 00 

AAAGTTATCG ACCTGATGTG 3 50 

GTTCGGCGGT GCGGGTGTAG 4 00 

GTAACATCGC GATCGAGCAC 450 

GAGCGTACTC GTGAGGGTAA 500 

CGTTATCGAT AAAGTATCCC 550 

GAAACCGTCT GCGCGTTGCG 600 

CGTGACGAAG GTCGTGACGT 650 

CACCCTGGCC GGTACTGAAG 700 

CGGTAGGTTA TCAGCCGACC 750 

CGTATCACCT CCACCAAAAC 800 

819 



2) INFORMATION FOR SEQ ID NO: 746 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Neisseria canis 

(B) STRAIN: ATCC 14687 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 746 

35 GCGATTCTAT TCCGCGCGTA TATGATGCTC TTAAACTAGT GGATAGAGAA 50 

CTGACGCTTG AAGTACAACA ACAGTTGGGT GATGGTGTCG TTCGTACTAT 100 

TGCGATGGGT AGTTCCGACG GCCTCAAACG AGGTTTGGCG GTAGTTAACA 150 

CCGGTGCTCC AATTACAGTG CCTGTGGGCA AAGCAACATT AGGCCGTATT 200 

ATGGACGTAT TAGGTAATCC GGTTGATGAA GCTGGTCCGA TTGGCTCCGA 250 

40 GCAAACCCGA GCAATCCACC AACCTGCTCC TAAGTTCGAC GAGCTTTCTA 300 

GCGCCACAGA GATTTTGGAA ACAGGTATTA AAGTAATTGA TTTGCTTTGC 350 

CCGTTTGCCA AAGGCGGTAA AGTAGGTTTG TTTGGTGGTG CGGGAGTGGG 400 

CAAAACCGTA AATATGATGG AGTTGATTAA CAACATCGCG AAAGCACACA 450 

GCGGTTTGTC TGTATTTGCC GGTGTGGGTG AACGGACGCG TGAAGGTAAT 500 

45 GACTTTTATC ATGAGATGAA AGATTCCAAT GTATTAGATA AAGTAGCCAT 550 
GGTTTACGGT CAGATGAACG AGCCTCCCGG TAACCGTTTG CGCGTTGCGC ■ 600 

TAACTGGCTT GTCTATGGCC GAATTCTTCC GTGACGAGAA AGATGAAAAC 650 

GGTAAAGGCC GTGATGTATT GTTCTTTGTA GACAATATTT ACCGCTATAC 700 

CCTAGCCGGT ACAGAAGTAT CTGCATTGCT TGGCCGTATG CCTTCGGCAG 750 

50 TAGGTTATCA GCCGACGTTG GCAGAGGAAA TGGGCCGCTT GCAAGAGCGT 800 

ATTACYTCAM CCCAAACAGG CTCT 824 



55 2) INFORMATION FOR SEQ ID NO: 747 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

425 
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(D) TOPOLOGY: Linear 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria cinerea 

(B) STRAIN: ATCC 14685 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 747 



10 



CGCGACGCTA TCCCGCATGT TTACGATGCC CTGAAATTGG ACGAGAACGG 50 

TCTGACTCTG GAGGTTCAAC AGCTTCTGGG CGACGGCGTT GTCCGTACTA 100 

TTGCAATGGG TAGTTCAGAC GGCCTTAAAC GCGGTATGTC TGTAAGCAAT 150 

ACTGGTGCGC CAATCACTGT GCCGGTAGGT AAAGGTACAT TGGGTCGTAT 2 00 

15 TGTCGACGTA TTGGGTACGC CTGTTGATGA AGCAGGTCCG ATCGATACCG 250 

ACAAAAGCCG TGCCATTCAC CAAACTGCTC CGAAATTCGA CGAGTTGTCT 300 

TCAGCTACCG AATTGTTGGA AACCGGTATT AAAGTGATCG ACTTGCTGTG 350 

TCCGTTTGCT AAAGGCGGTA AAGTAGGTCT GTTCGGTGGT GCCGGTGTGG 400 

GCAAAACCGT GAACATGATG GAATTGATCA ACAACATCGC CAAAGCGCAC 450 

20 AGCGGTCTGT CCGTGTTCGC AGGTGTGGGT GAGCGTACCC GTGAAGGTAA 500 

CGACTTCTAC CACGAGATGA AAGATTCCAA CGTATTGGAT AAAGTAGCCA 550 

TGGTGTATGG CCAAATGAAC GAACCTCCGG GCAACCGTCT GCGCGTTGCT 600 

TTGACCGGTT TGACTATGGC CGAATACTTC CGTGACGAAA AAGACGAAAA 650 

CGGTAAAGGC CGCGACGTAT TGTTCTTCGT TGACAACATC TACCGTTACA 700 

25 CTTTGGCCGG TACTGAAGTA TCTGCACTGT TGGGCCGTAT GCCTTCTGCA 750 

GTGGGTTACC AACCGACATT GGCTGAAGAA ATGGGTCGTT TGCAAGAGCG 800 

TATTACCTCT ACCCAAACCG GTTCCATTAC T 831 



30 



40 



2 ) INFORMATION FOR SEQ ID NO: 748 



(i) SEQUENCE CHARACTERISTICS; 
(A) LENGTH: 862 bases 
35 (B) TYPE: Nucleic acid 

' (C) STRANDEDNESS : Double 
(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria cuniculi 

(B) STRAIN: ATCC 14688 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 748 

CCGTGGCCAA GTACCACAAA TTTATGACGC ACTGAGTGTT GATGGCACCG 50 

AAACAACCTT GGAAGTTCAA CAGCAGTTGG GTGATGGCGT GGTGCGTACC 100 

ATTGCGATGG GTTCAACCGA AGGCTTGAAG CGTGGTTTGA CTGTATCTAA 150 

50 CTCTGGTGCA CCGATTTCTG TGCCAGTGGG TCAAGCGACT TTGGGTCGTA 200 

TTATGGATGT GTTGGGTCGT CCAATCGACG AGGCAGGTCC TGTAAATGCT 250 

CAAGAAAAAT GGTCAATTCA CCGTGAAGCA CCAAGCTATG ATGAGCAATC 300 

AAACTCAACT GAGCTGCTAG AAACAGGCAT CAAAGTAATT GATTTGCTTT 350 

GCCCATTTGC TAAAGGTGGT AAAGTTGGCT TGTTCGGTGG TGCAGGTGTG 400 

55 GGTAAAACCG TGAATATGAT GGAGCTGATT AATAATATCG CTCTGAAGCA 450 

TTCAGGTCTT TCTGTTTTTG CAGGTGTTGG TGAGCGTACT CGTGAGGGTA 500 

ACGATTTTTA TCACGAAATG CAAGAAGCAG GCGTTGTTAA TACCGAAGAC 550 

TTCACCAAGT CAAAAGTAGC GATGGTTTAT GGTCAGATGA ATGAGCCACC 600 

AGGAAACCGT TTGCGTGTTG CATTGACAGG CTTGACGATG GCAGAATATT 650 

60 TCCGTGACGA AAAAGATGAA GCAACAGGCA AAGGGCGTGA TGTTCTATTG 700 

426 
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TTCGTTGATA ACATCTATCG 
ACTTCTAGGT CGTATGCCAT 
AAGAAATGGG TGCGTTGCAA 
ATCACTTCGG GG 



TTACACACTG GCTGGTACGG 
CAGCAGTAGG TTATCAACCG 
GAGCGTATTA CCTCAACGCA 



PCT/CA00/01150 

AAGTGTCAGC 750' 
ACTCTGGCTG 800 
ATCGGGTTCG 850 

862 



2) INFORMATION FOR SEQ ID NO: 74 9 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria elongata subsp. elongata 
20 (B) STRAIN: ATCC 25295 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 749 

GGAACTCCCA CGTGACGCTA TCCCGCATGT TTTTGATGCA TTAAAATTAG 50 

25 TTGAAAATGA CCTAACCTTA GAAGTTCAAC AACTTTTGGG GGATGGTGTA 100 

GTGCGTACCA TTGCGATGGG TAGTTCAGAT GGATTAAAGC GTGGTATGGC 150 

TGTGAATAAT ACCGGAGCTC CGATTACTGT TCCTGTTGGC CGTGAAACTT 2 00 

TGGGTCGTAT CATGGATGTA TTGGGTAATC CGGTTGATGA GGCAGGTCCG 250 

GTAAATGCAT CCAATACACG TGCGATCCAT CAAGAGGCTC CTAAGTTTGA 300 

30 TGAGCTTTCT TCAACAACGG AATTATTAGA AACTGGCATT AAGGTTATCG 350 

ACTTGTTATG TCCGTTTGCC AAAGGTGGTA AAGTAGGTCT GTTTGGTGGT 400 

GCGGGTGTAG GTAAAACCGT AAATATGATG GAGTTAATTA ACAAC ATTG C 450 

CAAGGCACAT AGCGGTTTGT CTGTGTTTGC AGGCGTGGGT GAACGTACTC 500' 

GTGAAGGTAA TGACTTCTAT CACGAGATGA AAGATTCCAA CGTATTGGAC 550 

35 AAAGTGGCAA TGGTTTACGG TCAGATGAAC GAACCTCCAG GCAACCGTCT 600 

GCGCGTTGCT TTGACCGGTT TGACTATGGC CGAATACTTC CGTGACGAAA 650 

AAGACGAAAA CGGTAAAGGT CGCGACGTAT TGTTCTTCGT GGACAACATT 700 

TACCGTTACA CTTTGGCCGG TACGGAAGTA TCCGCATTGC TGGGTCGTAT 750 

GCCTTCAGCA GTAGGTTACC AACCGACATT GGCTGAAGAA ATGGGTCGTT 800 

40 TGCAAGAGCG TATTACCTCT ACCCAGACAG GCTCTATTAC TTCC 844 



2) INFORMATION FOR SEQ ID NO: 750 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Neisseria flavescens 

(B) STRAIN: ATCC 1312 0 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 750 

60 CGCGACGCTA TTCCGCATGT TTACGATGCC CTGAAATTGG ACGAGAACGG 50 

427 
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TCTGACTCTG GAAGTTCAAC AGCTTCTGGG TGACGGCGTT GTCCGT&CTA 100' 

TTGCAATGGG TAGTTCAGAC GGCCTGAAAC GCGGCATGTC TGTAAGCAAT 150 

ACCGGTGCGC CAATCACTGT GCCGGTAGGT AAAGGTACAT TGGGCCGTAT 200 

TGTCGACGTA TTGGGTACGC CTGTTGATGA AGCAGGTCCG ATCGATACCG 250 

5 ACAAGAGCCG TGCCATTCAC CAAACTGCTC CGAAATTCGA TGAGTTGTCT 300 

TCAACTACCG AATTGTTGGA AACCGGTATT AAAGTGATCG ACTTGCTGTG 350 

TCCGTTTGCT AAGGGCGGTA AAGTAGGTCT GTTCGGTGGT GCCGGTGTGG 400 

GCAAAACCGT GAACATGATG GAATTGATCA ACAACATCGC CAAAGCGCAC 450 

AGCGGCCTGT CCGTGTTCGC AGGTGTGGGT GAACGTACCC GTGAAGGTAA 500 

10 CGACTTCTAC CACGAGATGA AAGATTCCAA CGTATTGGAT AAAGTAGCCA 550 

TGGTGTATGG TCAAATGAAC GAACCTCCAG GCAACCGTCT GCGCGTTGCT 600 

TTGACCGGTT TGACTATGGC CGAATACTTC CGTGACGAAA AAGATGAAAA 650 

CGGTAAAGGT CGCGACGTAT TGTTCTTCGT TGACAACATC TAC CGTTACA 700 

CTCTGGCCGG TACCGAAGTA TCCGCACTGT TGGGTCGTAT GCCTTCTGCA 750 

15 GTGGGTTACC AACCGACATT GGCTGAAGAA ATGGGTCGTT TGCAGGAGCG 800 

TATTACTTCT ACGCAAACCG GTTCCATTAC TTCC 834 



20 2) INFORMATION FOR SEQ ID NO: 751 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear „ 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria gonorrhoeae 

(B) STRAIN: ATCC 31426 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 751 



CGCGACATGA TTCCGCGCGT TTACGACGCT TTGAAATTAG ACGAAAACGG 50 

TCTGACTTTG GAAGTCCAAC AGCTTTTGGG TGATGGCGTA GTCCGTACCA 100 

TCGCTATGGG CAGCTCGGAC GGTTTGAAAC GCGGCATGAC TGTGAGCAAT 150 

ACTGGTTCGC CCATTACTGT GCCGGTAGGT AAAGGTACGT TGGGACGCAT 200 

40 TGTCGATGTA TTGGGAACTC CTGTTGACGA GGCAGGTCCA ATTGATACCG 250 

ACAAGAGTCG TGCCATCCAC CAAGCCGCTC CTAAGTTTGA CGAACTGTCT 3 00 

TCCACAACCG AATTGCTCGA AACGGGCATT AAAGTGATTG ACTTGCTGTG 350 

TCCGTTTGCC AAAGGCGGTA AAGTAGGTCT GTTCGGCGGT GCCGGTGTGG 4 00 

GTAAAACCGT GAACATGATG GAATTGATCA ACAACATCGC CAAAGCGCAC 450 

45 AGCGGCTTGT CCGTGTTCTC AGGCGTGGGT GAGCGTACCC GCGAAGGTAA 500 

CGACTTCTAC CACGAGATGA AAGATTCCAA CGTATTGGAT AAAGTAGCCA 550 

TGGTGTATGG CCAAATGAAC GAACCTCCAG GCAACCGTCT GCGCGTTGCT 600 

TTGACCGGTT TGACTATGGC TGAATACTTC CGTGACGAAA AAGACGAAAA 650 

CGGTAAAGGT CGTGACGTAT TGTTCTTCGT TGACAACATC TACCGTTACA 700 

50 CTCTGGCCGG TACCGAAGTA TCCGCACTGT TGGGCCGTAT GCCTTCTGCA 750 

GTGGGTTACC AACCGACATT GGCTGAAGAA ATGGGTCGTT TGCAAGAGCG 800 

TATTACCTCT ACCCAAACCG GTTCCATTAC TTCC 834 



55 

2) INFORMATION FOR SEQ ID NO: 752 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 825 bases 
60 *(B) TYPE: Nucleic acid 

428 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria gonorrhoeae 

(B) STRAIN : ATCC 27628 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 752 

ACATGATTCC GCGCGTTTAC GACGCTTTGA AATTAGACGA AAACGGTCTG 50 

ACTTTGGAAG TCCAACAGCT TTTGGGTGAT GGCGTAGTCC GTACCATCGC 100 

TATGGGCAGC TCGGACGGTT TGAAACGCGG CATGACTGTG AGCAATACTG 150 

15 GTTCGCCCAT TACTGTGCCG GTAGGTAAAG GTACGTTGGG ACGCATTGTC 200 

GATGTATTGG GAACGCCTGT TGACGAGGCA GGTCCAATTG ATACCGACAA 250 

GAGTCGTGCC ATCCACCAAG CCGCTCCTAA GTTTGACGAA CTGTCTTCCA 3 00 

CAACCGAATT GCTCGAAACG GGCATTAAAG TGATTGACTT GCTGTGTCCG 350 

TTTGCCAAAG GCGGTAAAGT AGGTCTGTTC GGCGGTGCCG GTGTGGGTAA 4 00 

20 AACCGTGAAC ATGATGGAAT TGAT CAACAA CATCGCCAAA GCGCACAGCG 450 

GCTTGTCCGT GTTCTCAGGC GTAGGTGAGC GTACCCGCGA AGGTAACGAC 500 

TTCTACCACG AGATGAAAGA TTCCAACGTA TTGGATAAAG TAGCCATGGT 550 

GTATGGCCAA ATGAACGAAC CTCCAGGCAA CCGTCTGCGC GTTGCTTTGA 600 

CCGGTTTGAC TATGGCTGAA TACTTCCGTG ACGAAAAAGA CGAAAACGGT 650 

25 AAAGGTCGTG ACGTATTGTT CTTCGTTGAC AACATCTACC GTTACACTCT 700 

GGCCGGTACC GAAGTATCCG CACTGTTGGG CCGTATGCCT TCTGCAGTGG 750 

GTTACCAACC GACATTGGCT GAAGAAATGG GTCGTTTGCA AGAGCGTATT 800 

ACCTCTACCC AAACCGGTTC CATTA 825 



30 



2) INFORMATION FOR SEQ ID NO: 753 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria lactamica 

(B) STRAIN: ATCC 23970 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 753 



GATGCGATTC CGCATGTTTA CGATGCCCTG AAATTGGACG AGAACGGTCT 50 

GACCCTGGAA GTCCAACAGC TTTTGGGTGA CGGCGTTGTC CGTACTATTG 100 

50 CAATGGGTAG TTCAGACGGC CTGAAACGCG GCATGTCTGT CAGCAATACC 150 

GGTGCGCCAA TCACTGTGCC GGTAGGTAAA GGTACGTTGG GCCGTATTGT 2 00 

CGACGTATTG GGTACACCTG TTGACGAAGC AGGTCCGATC GATACCGACA 250 

AGAGCCGCGC CATCCACCAA ACCGCCCCGA AATTCGACGA GTTGTCTTCA 300 

ACTACCGAAT TGTTGGAAAC CGGCATTAAA GTGATCGATT TGCTGTGTCC 350 

55 GTTTGCTAAG GGCGGTAAAG TAGGTCTGTT CGGTGGTGCC GGTGTGGGCA 400 

AAACCGTGAA CATGATGGAA TTGATCAACA ACATCGCCAA AGCGCACAGC 450 

GGTCTGTCCG TGTTTGCAGG CGTGGGCGAG CGTACCCGCG AAGGTAACGA 500 

CTTCTACCAC GAGATGAAAG ATTCCAACGT ATTGGATAAA GTAGCCATGG 550 

TGTATGGTCA GATGAACGAA CCTCCGGGCA ACCGTCTGCG CGTTGCTTTG 600 

60 ACCGGTTTGA CGATGGCCGA ATACTTCCGC GACGAAAAAG ACGAAAACGG 650 

429 
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PCT/CAO0/01150 



CAAAGGCCGC GACGTATTGT TCTTCGTGGA 
TGGCCGGTAC CGAAGTATCC GCACTGTTGG 
GGTTACCAAC CGACATTGGC TGAAGAAATG 
TACCTCTACC CAAACCGGTT CCATTACTTC 



CAACATCTAC CGTTACACCC 
GCCGTATGCC TTCCGCAGTG 
GGTCGTTTGC AAGAGCGTAT 
C 



7O0 
750 
800 
831 
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2) INFORMATION FOR SEQ ID NO: 754 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 836 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: 2241C 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 754 

CCACGCGACA TGATTCCGCG CGTTTACGAC GCTTTGAAAT TAGACGAAAA 50 

CGGTCTGACT TTGGAAGTCC AACAGCTTTT GGGCGACGGC GTAGTCCGTA 100 

CCATTGCGAT GGGCAGCTCG GACGGTTTGA AACGCGGCAT GACTGTGAGC 150 

AATACCGGTG CGCCCATTAC TGTGCCGGTA GGTAAAGGTA CGTTGGGACG 200 

CATTGTCGAT GTATTGGGAA CGCCTGTTGA CGAGGCAGGT CCAATCGATA 250 

CCGACAAGAG CCGTGCCATC CACCAAGCCG CTCCTAAGTT TGACGAACTG 300 

TCTTCCACAA CCGAATTGCT CGAAACGGGC ATTAAAGTGA TTGACTTGCT 350 

GTGTCCGTTT GCCAAAGGCG GTAAAGTAGG TCTGTTCGGC GGTGCCGGTG 4 00, 

TGGGTAAAAC CGTGAACATG ATGGAATTGA TCAACAACAT CGCCAAAGCG 450 

CACAGCGGCT TGTCCGTGTT CGCAGGCGTG GGTGAGCGTA CCCGCGAAGG 500: 

TAACGACTTC TACCACGAGA TGAAAGATTC CAACGTATTG GATAAAGTGG 550 

CAATGGTTTA CGGTCAGATG AACGAACCTC CGGGCAACCG TTTGCGCGTC 600 

GCATTGACCG GTTTGACCAT GGCGGAATAC TTCCGTGACG AAAAAGACGA 650 

AAACGGCAAA GGCCGCGACG TATTGTTCTT CGTGGACAAC ATCTACCGTT 700 

ACACTCTAGC TGGTACCGAA GTATCCGCAT TGTTGGGCCG TATGCCGTCT 750 

GCAGTGGGCT ACCAACCGAC ATTGGCAGAA GAAATGGGTC GTTTGCAGGA 800 

GCGTATTACC TCTACCCAAA CCGGTTCTAT TACTTC 836 



2) INFORMATION FOR SEQ ID NO: 755 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 837 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria mucosa 

(B) STRAIN: ATCC 19696 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 755 
CCGCGTGATG CCATTCCGCG TGTTTACGAC GCCCTGAAAT TGGATGCAAA 50 
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CGGCCTGACT TTGGAAGTGC AACAGCTTCT 
CTATTGCAAT GGGTAGTTCG GATGGTTTGA 
AATACAGGTG CGCCGATTAC AGTACCGGTA 
TATTGTCGAT GTATTGGGTA CGCCTGTTGA 
5 CCGACAAACA CCGTGCCATC CATCAGACAG 
TCTGCGACTA CTGAGCTGCT GGAAACCGGC 
GTGTCCGTTT GCCAAAGGCG GTAAAGTAGG 
TAGGCAAAAC CGTCAACATG ATGGAATTAA 
CATAGCGGTT TGTCCGTGTT TGCAGGTGTG 

10 TAATGACTTC TACCACGAGA TGAAAGATTC 
CGATGGTTTA CGGTCAGATG AACGAACCTC 
GCCTTGACCG GTTTGACGAT GGC CGAATAC 
AAGCGGCAAA GGCCGTGACG TATTGTTCTT 
ACACCCTGGC CGGTACCGAA GTATCCGCAT 

15 GCAGTAGGTT ACCAACCGAC ATTGGCTGAA 
GCGTATTACC TCTACCCAAA CAGGCTCCAT 



GGGTGACGGC GTAGTTCGTA ZOO 

AACGCGGCAT GACTGTAAGC 150 

GGTAAAGGTA CTTTGGGACG 200 

CGAAGCAGGT CCGATTGATA 250 

CTCCGAAATT TGATGAACTG 3 00 

ATTAAAGTGA TTGACTTGCT 350 

TCTGTTCGGT GGTGCCGGTG 400 

TTAACAACAT CGCCAAAGCA 450 

GGTGAGCGTA CCCGTGAAGG 500 

CAACGTATTG GACAAAGTGG 550 

CGGGTAACCG TCTGCGTGTA 600 

TTCCGTGATG AAAAAGACGA 650 

CGTGGACAAT ATCTACCGTT 700 

TGTTGGGTCG TATGCCTTCA 750 

GAAATGGGTC GTTTGCAAGA 800 

TACCTCC 837 



20 2) INFORMATION FOR SEQ ID NO: 756 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria sub f lava 

(B) STRAIN: ATCC 14221 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 756 



CGCGACGCTA TCCCGCATGT TTACGATGCC CTGAAATTGG ACGAGAACGG 50 

TCTGACTCTG GAAGTTCAAC AGCTTCTGGG TGACGGCGTT GTCCGTACTA 100 

TTGCAATGGG TAGTTCAGAC GGCCTGAAAC GCGGCATGTC TGTAAGCAAT 150 

ACTGGTGCGC CAATCACTGT GCCGGTAGGT AAAGGTACAT TGGGTCGTAT 200 

40 TGTCGACGTA TTGGGTACGC CTGTTGATGA AGCAGGTCCG ATCGATACCG 250 

ACAAGAGCCG TGCCATTCAC CAAACTGCTC CGAAATTCGA CGAGTTGTCT 300 

TCAACTACCG AATTGCTGGA AACCGGTATT AAAGTGATCG ACTTGCTGTG 350 

TCCGTTTGCT AAGGGCGGTA AAGTAGGTCT GTTCGGTGGT GCCGGTGTGG 400 

GCAAAACCGT GAACATGATG GAATTGATCA ACAACATCGC CAAAGCGCAC 450 

45 AGCGGTCTGT CCGTGTTCGC AGGTGTGGGT GAACGTACCC GTGAAGGTAA 500 

CGACTTCTAC CACGAGATGA AAGATTCCAA CGTATTGGAT AAAGTAGCCA 550 

TGGTGTATGG CCAAATGAAC GAACCTCCGG GCAACCGTCT GCGCGTTGCT 600 

TTGACCGGTT TGACTATGGC CGAATACTTC CGTGACGAAA AAGACGAAAA 650 

CGGTAAAGGT CGCGACGTAT TGTTCTTCGT TGACAACATC TACCGTTACA 700 

50 CTCTGGCCGG TACCGAAGTA TCTGCACTGT TGGGCCGTAT GCCTTCTGCA 750 

GTGGGTTACC AACCGACATT GGCTGAAGAA ATGGGTCGTT TGCAAGAGCG 800 

TATTACCTCT ACCCAAACTG GTTCCATTAC TTCC 834 



55 

2 ) INFORMATION FOR SEQ ID NO: 757 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 833 bases 
60 (B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Neisseria weaveri 

(B) STRAIN: ATCC 51223 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 757 

CCCGTGATGC TATTCCCATG TATACGATGC CCTGAAATTG GTAGATAACG 50 

ATCTGACCCT GGAAGTGCAA CAACTTTTAG GTGATGGTGT GGTTCGTACC 100 

ATTGCAATGG GTAGTTCAGA CGGCCTAAAA CGTGGTATGG CTGTTAACAA 150 

15 TACCGGCGCT CCGATTACTG TTCCGGTGGG GAAAGCCACC TTGGGACGTA 200 

TTATGGATGT GTTGGGTAAT CCGGTTGATG AAGCAGGTCC TGTTGTATCA 250 

GAAGAAACTC GCGCTATTCA TCAAGCTGCC CCTAAATTTG ACGAACTGTC 300 

TTCAGCAACT GAGTTGTTGG AAACAGGCAT TAAAGTAATT GACTTGCTGT 350 

GCCCGTTTGC CAAAGGTGGT AAAGTAGGTT TGTTTGGTGG TGCCGGCGTG 400 

2 0 GGTAAAACCG TAAATATGAT GGAGTTGATC AACAACATCG CGAAGGCACA 450 

TAGTGGTTTG TCTGTATTCG CCGGTGTAGG TGAGCGTACC CGTGAAGGTA 500 

ACGACTTCTA CCATGAAATG AAAGACTCTA ACGTATTGGA TAAAGTAGCC 550 

ATGGTTTATG GCCAGATGAA TGAACCTCCG GGTAACCGTT TGCGCGTTGC 600 

TTTGACTGGT TTGACTATGG CCGAATATTT CCGTGACGAG AAAGATGAAA 650 

25 ACGGCAAAGG TCGTGACGTC TTGTTCTTTG TGGATAATAT CTATCGCTAT 700 

ACTCTGGCCG GTACTGAAGT GTCTGCACTG TTAGGTCGTA TGCCGTCTGC 750 

AGTAGGTTAT CAGCCTACAT TGGCAGAAGA AATGGGTCGC TTGCAGGAGC 800 

GTATTACTTC TACTCAAACA GGTTCGATTA CTT 833 



30 
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2) INFORMATION FOR SEQ ID NO: 758 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria animalis 

(B) STRAIN: ATCC 19573 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 758 



CGCGATGCCA TTCCTCACGT TTACGATGCC CTGAAATTGG ACGACACCGG 50 

TCTGACTTTG GAAGTACAAC AACTTCTGGG CGACGGTGTG GTACGTACCA 100 

50 TTGCAATGGG TAGTTCAGAC GGCCTGAAAC GGGGTTTGTC TGTGAGCAAT 150 

ACCGGTTCTC CGATTGCCGT TCCTGTCGGT AAAGCGACTT TGGGTCGTAT 2 00 

TATGGACGTA TTGGGCAATC CGGTTGATGA AGCCGGTCCG GTTGCTACCG 250 

AAGAGAAACG TGCTATTCAC CAAGCCGCAC CGAAGTTTGA TGAATTGTCA 300 

TCAGCTACCG AGTTGTTGGA AACCGGTATT AAAGTAATCG ACTTGCTGTG 350 

55 TCCGTTTGCA AAAGGCGGTA AAGTAGGTCT GTTCGGCGGT GCCGGTGTGG 400 

GCAAAACCGT AAACATGATG GAATTGATCA ACAACATCGC CAAAGCACAC 450 

AGCGGTCTGT CTGTGTTTGC CGGTGTAGGT GAACGTACCC GCGAAGGTAA 500 

CGACTTCTAC CACGAGATGA AAGATTCCAA CGTGTTGGAT AAAGTAGCCA 550 

TGGTGTACGG TCAGATGAAT GAGCCGCCGG GCAACCGCTT GCGCGTGGCT 600 

60 TTGACCGGCC TGACTATGGC CGAATACTTC GTGACGAAAA AGACGAAAAC 650 
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GGCAAAGGTC GTGACGTATT GTTCTTCGTG 
ACTGGCCGGT ACCGAAGTAT CAGCATTGTT 
TAGGTTATCA GCCGACATTG GCAGAGGAAA 
ATTACCTCTA CCCAAACCGG TTCGATTACC 



GACAACATTT ACCGCTACAC 
GGGCCGTATG CCGTCTGCAG 
TGGGTCGCTT GCAAGAGCGT 
TCT 



70O 
750 
800 
833 
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2) INFORMATION FOR SEQ ID NO: 759 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus penneri 

(B) STRAIN: ATCC 33519 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 759 

AGCGTCCCTA AAGTATACGA CGCTCTTGAG GTTATGAATG GTAAAGAAAA 50 

ACTGGTGCTG GAAGTTCAGC AACAGTTAGG CGGTGGTATC GTTCGTTGTA 100 

TCGCAATGGG TACATCAGAC GGTTTAAGCC GTGGCTTAAA AGTTGAAGAT 150 

TTAGGCCACC CAATTGAAGT ACCAGTAGGT AAAGCAACAC TGGGACGTAT 200 

CATGAACGTT CTGGGTACAC CTATTGATAT GAAAGGTGAT ATTGCAACTG 250 

AAGAACGTTG GTCTATTCAC CGTGAAGCAC CAACCTACGA AGAGTTATCA 300 

AACTCACAAG AACTGCTTGA AACCGGTATC AAAGTAATGG ACTTAATCTG 350 

TCCGTTTGCT AAAGGTGGTA AAGTAGGTCT 'GTTCGGTGGT GCGGGTGTGG 400 

GTAAAACAGT TAACATGATG GAATTGATCC GTAATATCGC GATCGAGCAC 450 

TCAGGTTACT CTGTATTTGC TGGTGTTGGT GAGCGTACTC GTGAGGGTAA 500 

CGACTTCTAT CATGAAATGA CAGATTCTAA CGTTCTTGAC AAAGTATCGT 550 

TAGTTTATGG TCAGATGAAT GAGCCACCAG GAAACCGTCT GCGTGTAGCA 600 

CTGACTGGTC TGACTATGGC TGAAAAATTC CGTGATGAAG GCCGTGACGT 650 

ACTGTTATTC GTCGATAACA TTTATCGTTA CACCTTAGCC GGTACAGAAG 700 

TATCAGCACT GTTAGGTCGT ATGCCATCAG CGGTAGGTTA CCAGCCAACA 750 

TTGGCTGAAG AGATGGGTGT TCTGCAAGAA CGTATCACTT CAACCAAAAC 800 

AGGTTCAATC ACCTCTGTA 819 



2) INFORMATION FOR SEQ ID NO: 760 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Enteritidis 

(B) STRAIN: ATCC 13076 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 760 
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GCCGTACCAC GCGTGTACGA TGCCCTTGAG GTGCAGAATG GTAATGAGAA* <£o 

GCTGGTGCTG GAAGTTCAGC AGCAGCTTGG CGGCGGTATC GTGCGTACCA 100 

TCGCGATGGG GTCTTCTGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAT 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCCACGC TGGGTCGTAT 200 

5 CATGAACGTC CTGGGCGAAC CGGTCGACAT GAAAGGCGAG ATCGGCGAAG 250 

AAGAGCGTTG GGCGATTCAC CGCGCAGCAC CTTCCTACGA AGAGTTGTCA 300 

AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACCGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 450 

10 TCCGGTTACT CAGTGTTTGC GGGCGTAGGG GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAT AAAGTATCCC 550 

TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA 600 

CTGACCGGCC TGACCATGGC GGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 700 

15 TATCCGCACT GCTGGGCCGT ATGCCTTCCG CAGTAGGTTA CCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAAAC 800 

CGGTTCTATC ACCTCCGTA 819 



20 



2) INFORMATION FOR SEQ ID NO: 761 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 812 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia pestis 

(B) STRAIN: KIM D27* 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 761 

AC CAAAAGTG TACAACGCCC TTGAGGTAGA AGGTACAACT GAAAAGTTAG 50 

TGCTGGAAGT TCAGCAACAG TTGGGCGGTG GTGTTGTTCG TTGTATCGCA 100 

ATGGGCTCTT CCGATGGTCT GAGCCGTGGG TTGAAAGTAA CCAACCTAGA 150 

40 ACACCCGATC GAAGTACCGG TTGGTAAAGC GACCCTTGGC CGTATCATGA 2 00 

ACGTATTGGG TGAACCAATC GACATGAAAG GTCCTATCGG TGAAGAAGAG 250 

CGTTGGGCAA TCCATCGCGA AGCGCCTTCT TATGAAGAGC TTGCTAGCTC 300 

ACAAGATCTG TTAGAAACCG GTATCAAGGT TATGGACCTG ATTTGTCCGT 350 

TTGCTAAGGG CGGTAAAGTC GGTCTGTTCG GTGGTGCGGG TGTAGGTAAA 400 

45 ACAGTAAACA TGATGGAGCT GATCCGTAAC ATCGCGATCG AGCACTCTGG 4 50 

GTATTCTGTA TTTGCCGGTG TAGGTGAGCG TACCCGTGAG GGTAATGACT 500 

TCTACCATGA AATGACTGAC TCCAACGTTT TGGACAAAGT ATCCTTGGTT 550 

TACGGCCAGA TGAATGAGCC ACCAGGTAAC CGTCTACGCG TTGCACTGAC 600 

CGGCCTGACC ATGGCGGAGA AATTCCGTGA TGAAGGTCGT GACGTACTGC 650 

50 TGTTTATCGA TAATATCTAT CGTTATACCC TAGCTGGTAC GGAAGTATCC 700 

GCATTGCTGG GTCGTATGCC ATCAGCGGTA GGTTATCAGC CAACACTGGC 750 

TGAAGAGATG GGTGTGTTGC AGGAACGTAT TACTTCCACT AAGACGGGTT 800 

CAATCACCTC TG 812 

55 

2) INFORMATION FOR SEQ ID NO: 762 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 4 08 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 x (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Burkholderia mallei 

(B) STRAIN: GB8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 762 



10 



CGACGTGATC GAGCCCTTCT TCGTCGACGT GATGCGCTCT TGCAGCTTGC 50 

CCATTTCTTC AGCCAGCGTC GGCTGATAGC CCACTGCCGA CGGCATACGG 100 

15 CCGAGCAGCG CCGACACTTC GGTACCGGCC AGCGTGAAAC GGTAGATGTT 150 

GTCGACGAAG AACAGCACGT CGAGGCCTTC GTCACGGAAG TGCTCGGCCA 200 

TCGTGAGGCC CGTCAGCGCC ACGCGCAGAC GGTTGCCCGG CGGCTCGTTC 250 

ATCTGGCCGT ACACCAGCGC GACCTTGTCG AGAACGTTCG AGTCCTTCAT 300 

TTCGTGGTAG AAGTCGTTCC CTTCACGGGT ACGCTCGCCC ACGCCCGCGA 350 

20 ACACGGAGTA ACCGCCGTGC TCCTTCGCGA TGTTGTTGAT GAGCTCCATC 400 

ATGTTGAC 408 



25 2) INFORMATION FOR SEQ ID NO: 763 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA ? 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium sordellii 

(B) STRAIN: ATCC 9714 



40 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 763 



GAACTTATAA ACAACATAGC TACTCAACAT GGTGGTATAT CAGTATTCGC 50 

AGGTGTTGGA GAGAGAACAA GAGAAGGTAA CGACCTTTAC GGAGAAATGA 100 

GTGAGTCTGG AGTTATAAAC AAGACAGCTC TAGTATTCGG ACAAATGAAT 150 

GAGCCACCTG GAGCAAGAAT GAGAGTTGCT TTAACTGGTC TTACAATGGC 200 

45 TGAATATTTC AGAGATCAAG AAGGACAAGA CGTTTTATTA TTCGTAGATA 250 

ATATATTCCG TTTCACTCAA GCAGGATCTG AGGTTTCTGC ACTTCTTGGA 300 

CGTACTCCAT CAGCAGTTGG ATACCAACCA ACATTAGCTA CAGAGATGGG 350 

TAGATTACAA GAGAGAATAA CATCTACAAA TAAAGGGTCT ATAACATCAG 400 



2) INFORMATION FOR SEQ ID NO: 764 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 4 05 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

435 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



5 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium novyi 

(B) STRAIN: ATCC 19402 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 764 



TTAATTCAAG AATTAATCAA CAATATAGCG AAGGAACACG GTGGATTATC 50 

TGTATTTACA GGAGTTGGAG AAAGAACAAG AGAAGGTAAT GACCTTTACT 100 

10 ATGAAATGAA AGAATCTGGT GTTATAAATA AAACAGCACT AGTATTTGGT 150 

CAAATGAATG AGCCACCTGG AGCAAGAATG AGAGTTGCTC TTACAGGACT 200 

TACTATGGCA GAATATTTCA GGGATCAAGG ACAAAACGTA CTTTTATTCA 250 

TAGACAACAT ATTTAGATTC ACTCAAGCAG GTTCAGAGGT GTCAGCTTTA 300 

CTTGGAAGAA TACCTAGTGC CGTTGGATAC CAACCAACAC TAGCAACAGA 350 

15 AATGGGTGCG CTTCAAGAAA GAATAACATC TACAAAGCAT GGTTCTATAA 400 

CATCA 405 



20 2) INFORMATION FOR SEQ ID NO: 765 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 393 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM:. Clostridium botulinum 

(B) STRAIN: 20:3.1 



35 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 765 



AATTAATAAA TAACATAGCT AAAGAACATG GTGGATTATC AGTGTTTACT 50 

GGAGTTGGAG AAAGATCTAG AGAAGGTAAT GATCTTTATC ATGAAATGAG 100 

AGAATCAGGC GTTATAGATA AAACCGCATT AGTATTTGGT CAAATGAATG 150 

AGCCACCTGG TGCTAGAATG AGAGTTGCAT TAACAGGGTT AACTATGGCT 2 00 

40 GAATATTTTA GAGATAAAGG TCAAGATGTA TTACTATTTA TAGATAATAT 250 

ATTTAGATTT ACTCAAGCTG GATCAGAAGT TTCAGCATTA CTTGGAAGAA 300 

TACCATCAGC AGTTGGTTAC CAACCAACTT TAGCAACTGA AATGGGTGCA 350 

CTTCAAGAGA GAATTACATC AACTAAGAAT GGTTCTATAA CAT 393 



2) INFORMATION FOR SEQ ID NO: 766 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 3 99 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium histolyticum 

(B) STRAIN: ATCC 19401 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 766 

AGGAACTTAT AAATAATATT GCAAAACAAT ATGGAGGTCT ATCTGTATTT 50 

ACAGGTGTTG GTGAAAGAAC AAGAGAAGGT AATGACCTAT ATAATGAAAT 100 

5 GAAAGATTCT GGGGTTATAG AGAAAACTGC ACTAGTATTT GGACAGATGA 150 

ATGAACCACC AGGAGCGAGA ATGAGAGTAG CATTGACAGG ACTTACTATG 200 

GCAGAATATT TTAGAGATCA AGGGCAAGAT GTACTTTTAT TTATAGATAA 250 

TATATTTAGA TTTACGCAGG CTGGTTCTGA AGTTTCTGCA TTGTTAGGAA 300 

GAATTCCAAG TGCAGTTGGA TATCAACCAA CCCTTGCAAC TGAAATGGGT 350 

10 GCATTACAAG AAAGAATAAC ATCCACAAAA AATGGATCAA TTACTTCAG 399 



2) INFORMATION FOR SEQ ID NO: 767 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 390 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
2 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Peptostreptococcus prevotii 

(B) STRAIN: ATCC 9321 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 767 

30 CATGATGGAA CTGATCCGTA ACATCGCCAT CGAGCACAGC GGTTATTCCG 50 

TGTTCGCCGG TGTGGGTGAG CGTACTCGTG AGGGTAACGA CTTCTACCAC 100 

GAGATGAAGG ACTCCAACGT TCTGGACAAA GTGGCACTGG TCTACGGTCA 150 

GATGAACGAG CCGCCGGGTA ACCGTCTGCG CGTAGCACTG ACTGGCCTGA 200 

CCATGGCCGA GAAGTTCCGT GACGAAGGTA ACGACGTTCT GTTGTTCGTC 2 50 

35 GACAACATCT ACCGTTACAC CCTGGCCGGT ACTGAAGTAT CCGCACTGCT 3 00 

GGGCCGTATG CCTTCGGCAG TAGGTTACCA GCCGACCCTG GCTGAAGAGA 3 50 

TGGGCGTTCT GCAAGAACGT ATCACTTCGA CCAAGGAAGG 3 90 



40 



50 



2) INFORMATION FOR SEQ ID NO: 768 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 953 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Absidia corymbifera 

(B) STRAIN: ATCC 46775 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 768 

AGGTCTTGTT CGTGGCAAAA GGTCATTGAC ACTGGTGCTC CTATCACCAT 50 

TCCTGTTGGT AACGAAGTCC TTGGTCGTAT CATTAACGTC ATTGGTGAGC 100 

CCATTGATGA GCGTGGTCCY ATCAAGTCCA AGGCCACTCG TGCTATCCAC 150 

60 GCTGATGCTC CCGAGTTCGT TGATCAATCC CCCACTCCCG AGATTCTCGA 2 00 
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GACTGGTATC AAGGTTGTCG ATTTGCTTGC 
AGATTGGTCT TTTCGGTGGT GCCGGTGTCG 
GAATTGATCA ACAACATTGC CAAGGCTCAC 
TGGTGTCGGT GAACGTACTC GTGAAGGCAA 
5 TTTCCACTGG TGTCATCAAG CTTGAAGGTG 
TTTGGTCAAA TGAACGAACC CCCCGGAGCT 
TGGTTTGACC ATTGCCGAAT ACTTCCGTGA 
TGCTCTTCAT TGACAACATT TTCCGTTTCA 
TCCGCTTTGC TTGGTCGTAT TCCCTCTGCT 

10 YTCCACTGAT ATGGGTGGTA TGCAAGAGCG 
GTTCCATTAC CTCCGTGCAA GCTGTCTACG 
GATCCTGCTC CTGCTACTAC TTTTGCTCAC 
GTCTCGTTCC ATTGCTGAGT TGGGTATCTA 
ACTCCAAGTC CCGTATCTTG GATCCCCGTA 

15 TCTGTTGCCA CTGGTGTCCA ACAAATCCTT 
AGA 



PCT/CAOO/01150 

TCCCTATGCT CGTGGtGGTA 25/5 

GCAAGACTGT CTTGATTCAA 300 

GGTGGTTACT CTATCTTCTG 350 

CGATTTGTAC CACGAAATGA 400 

AATCCAAGTG TGCTCTTGTC 450 

CGTGCCCGTG TTGCCTTGAC 500 

TGAGGAAGGT CAAGATGTGT 550 

CTCAAGCCGG TTCTGAAGTG 600 

GTCGGTTACC AACCCACTCT 650 

TATTACTACC ACCAAGAACG 700 

TCCCTGCTGA CGATTTGACT 750 

TTGGACGCCA CCACTGTGTT 800 

CCCTGCTGTC GATCCCCTTG 850 

TCGTTGGTGA TGAGCACTAC 900 

CAAAACTACA AGTCGCTTCA 950 

953 



20 2 ) INFORMATION FOR SEQ ID NO: 769 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1343 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alternaria alternata 

(B) STRAIN: ATCC 62099 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 769 



35 



CGCGGTTCCA AGGCCACCGA CACCGGTGCC CCCATCAAGA TTCCCGTTGG 50 

TCACGGTACC CTTGGTCGTA TCATGAACGT CACTGGTGAC CCCATTGACG 100 

AGCGTGGTCC CATCAAGGCC ACCAAGTACG CTCCCATCCA CGCCGACCCC 150 

CCGGAGTTCA CCGAGCAATC CACCTCCGCT GAGGTCCTCG TTACCGGTAT 200 

40 CAAGGTTGTC GACCTGTTGG CTCCTTACGC TCGTGGTGGA AAGATTGGTC 250 

TCTTCGGAGG TGCTGGTGTC GGAAAGACTG TCTTCATTCA GGAGCTGATT 300 

GTAAGGAGAC ACACTGTCTA CTGGCTGAGC ATTAGCTAAC GGCAGGCAGA 350 

ACAACATCGC CAAGGCCCAC GGTGGTTTCT CTGTCTTCAC TGGTGTCGGT 400 

GAGCGTACCC GTGAGGGTAA CGATCTGTAC CACGAGATGC AGGAGACTTC 450 

45 CGTCATTCAG CTTGACGGTG ACTCCAAGGT CGCCCTCGTC TTCGGTCAGA 500 

TGAACGAGCC CCCGGGTGCC CGTGCCCGTG TCGCTCTTAC TGGTCTTACT 550 

GTTGCTGAGT AAGTCTTGAA TTCACTGTGT TGACAACGTC GTGGCTAATG 600 

GGAAAAAAGA TACTTCCGTG ACGAAGAGGG ACAGGATGTG CTTCTCTTCA 650 

TCGACAGTAA GTGCTTGTAC GAACTGCCTG TGAGACATAC ACTGACTTCG 700 

50 GCAATAGACA TTTTCCGTTT CACCCAGGCC GGTTCCGAGG TGTCCGCTCT 750 

TCTTGGTCGT ATCCCCTCTG CCGTCGGTTA CCAGCCCACC CTCGCCATTG 80 0 

ACATGGGTGT CATGCAGGAG CGTATTACCA CCACCACCAA GGGTTCCATC 850 

ACCTCCGTCC AGGCCGTCTA CGTGCCCGCT GACGATTTGA CTGACCCTGC 900 

CCCCGCCACC ACCTTCGCCC ATTTGGACGC CACCACTGTC TTGTCCCGTG 950 

55 GTATCTCCGA GTTGGGTATC TACCCCGCCG TCGACCCTCT TGACTCCAAG 1000 

TCCCGTATGT TGGACCCCCG TGTCATTGGT CAGGACCACT ACGACACCGC 1050 

CACCCGCGTT CAGCAGATTC TCCAGGAGTA CAAGTCGCTC CAGGATATCA 1100 

TTGCCATTCT CGGTATGGAC GAGTTGTCGG AAGCTGACAA GCTTACCGTC 1150 

GAGCGTGCCC GTAAGATCCA GCGTTTCTTG AGCCAGCCTT TCGCTGTCGC 1200 

60 CCAGGTCTTC ACTGGTATTG AGGGCAAGCT TGTCGACCTC AAGGACACCA 1250 

438 
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TCCGATCATT CAAGGCTATC TTGACTGGTG AGGGTGACGA CCTTCCCGAG 13 00 
GGTGAGTCTC GACTATCTCC GCATTCATAG CGTATAACTG ACA 1343 



2 ) INFORMATION FOR SEQ ID NO: 770 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 480 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus flavus 

(B) STRAIN: ATCC 26947 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 770 

TTCAGGAGTT GATTGTATGT TCACCTGCAA CATAAGACTT CCCATTCTCC 50 

ACTCTTTTCT AACTCTTCAC AGAACAACAT TGCCAAGGCT CACGGTGGTT 100 

ACTCTGTCTT CACTGGTGTC GGTGAGCGTA CCCGTGAGGG TAACGATCTG 150 

25 TAC CACGAAA TGCAGGAGAC TGGTGTCATT CAGCTCGAGG GTGAAT CTAA 2 00 

GGTCGCCCTT GTCTTCGGTC AGATGAACGA GCCCCCAGGT GCCCGTGCCC 250 

GTGTCGCCCT TACCGGTCTG ACCATCGCCG AGTACTTCCG TGACGAGGAA 3 00 

GGTCAGGATG TGCTGCTCTT CATTGACAAC ATTTTCCGTT TCACCCAGGC' 350 

CGGTTCTGAG GTGTCTGCCC TTCTTGGTCG TATCCCCTCC GCTGTCGGTT 400 

30 ACCAGCCCAC TCTGGCCGTC GACATGGGTG GTATGCAGGA GCGTATTACC 450 

ACCACCACCA AGGGTTCCAT TACCTCCGTC ' 480 



35 2) INFORMATION FOR SEQ ID NO: 771 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1174 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mucor circinelloides 

(B) STRAIN: ATCC 38592 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 771 



50 



CTCGAACAAG AYAACTTGCC YGCCATTTTG AACGCCCTTG AAGTCAAGGA 50 

YCACTCTGGT GGACGTCTCG TYCTCGAAGT CTCTCAACAT TTGGGTGAGA 100 

ACACTGTCCG TACTATTGCT ATGGATGGTA CTGAAGGTAA GTTATGTYCA 150 

TCCCANNGGA TACAGTCARA CAGMAATGTC TAGTGGTTAT AGCAGYAGCA 200 

55 GATGATTGAC CAATATGTTA GGTCTTGTCC GGTGGTCAAA AGGTTGTTGA 250 

CACTGGTGCT CCCATTACCA TCCCCGTTGG TAAGGAAGTC CTTGGTCGTA 300 

TCATCAACGT TATTGGTGAA CCCATTGATG AACGTGGTCC CATTGACGCC 350 

AAGACTCACC GTCCTATTCA CGCTGAAGCT CCCGAATTCG TTGATCAATC 400 

CCCCACTCCC GAAATCCTCG AGACTGGTAT CAAGGTYGTC GATTTGTTGG 450 

60 CTCCTTATGC TCGTGGTGGT AAGATTGGTC TCTTCGGTGG TGCTGGTGTC 500 
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GGTAAGACTG TCTTGATTCA AGAATTGATT AACAACATYG CCAAGGCTCA 550 

CGGTGGTTAC TCTATCTTCT GTGGTGTCGG TGAACGTACT CGTGAGGGTA 600 

ACGATTTGTA CCATGAAATG ATTGAAACCG GTGTCATTCA ATTGGAAGGC 650 

GAGTCCAAGT GTGCTCTCGT CTTTGGTCAA ATGAACGAAC CCCCAGGTGC 700 

5 TCGTGCCCGT GTCGCTTTGA CTGGTTTGAC TATTGCTGAA TACTTCCGTG 750 

ATGATGAGGG TCAAGATGTC TTGCTTTTCA TTGATAACAT TTTCAGATTC 800 

ACTCAAGCTG GTTCTGAGGT ATCTGCCCTT TTGGGTCGTA TTCCTTCCGC 850 

TGTCGGTTAC CAACCCACTC TTTCCACYGA TATGGGTGGT ATGCAAGAGC 900 

GTATTACTAC CACCAAGAAC GGTTCCATTA CCTCCGTCCA AGCTGTCTAC 950 

10 GTCCCTGCTG ATGATTTGAC CGATCCTGCT CCTGCCACCA CTTTTGCTCA 1000 

CTTGGATGCC ACCACTGTCT TGTCTCGTTC CATCGCTGAA TTGGGTATCT 1050 

AYCCCGCTGT CGATCCTCTT GATTCCAAGT CTCGTATCCT CGATCCCCGT 1100 

ATTGTCGGTG ATGAGCACTA CAAGGTTGCC ACTGAAGTTC AACAAATCCT 1150 

CCAAAACTAC AAGTCTCTCC AAGA 1174 

15 



2) INFORMATION FOR SEQ ID NO: 772 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 467 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Piedraia hortai 
30 (B) STRAIN: ATCC 24292 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 772 

AGGAGCTTAT CAACAACATC GCCAAGGCTC ACGGTGGTTA CTCCGTCTTC 50 

35 ACTGGTGTCG GTGAGCGTAC TCGTGAGGGT AACGATTTGT ACCACGAGAT 100 

GCAAGAGACT TCCGTCATTC AGCTTGACSG CGAGTCCAAG GTCGCTCTCG 150 

TGTTCGGTCA GATGAACGAG CCCCCGGGTG CCCGTGCCCG TGTTGCCCTG 200 

ACTGGTCTTA CCATCGCTGA GTACTTCCGT GATGCCGAGG GTCAGGATGG 250 

TAAGTTCTAT AACTCTTGTC GCAAAGGTTT CATTCTGGTC GCTAACTTGC 3 00 

40 TCAGTGCTCC TGTTCATCGA CAACATTTTC CGTTTCACCC AGGCTGGTAT 350 

GGAGGTGTCT GCCCTCCTCG GTCGTATTCC TTCTGCCGTC GGTTACCAAC 4 00 

CTACTCTCGC CGTCGACATG GGTGGTATGC AAGAGCGTAT TACCACTACC 450 

AAGAAGGGAT CCATTAC 467 



2) INFORMATION FOR SEQ ID NO: 773 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 578 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudallescheria boydii 

(B) STRAIN: ATCC 44331 

60 
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(xi) SEQUENCE DESCR! 

TTGCCAAGGC CCACGGTGGT 
ACCCGTGAGG GTAACGATCT 
5 TCAGCTTGAT GGCGAGTCCA 
AGCCCCCTGG TGCCCGTGCT 
GAGTACTTCC GTGATGAGGA 
TTATCTTCTT TGCCACCACC 
GGCATCGTTT GTAGGGCGGG 
10 TTGAAGCCGC GGATGCCCGA 
GACAACAGTG CTTCTCTTCA 
GTTCCGAGGT GTCTGCCCTT 
CAGCCCACTC TTGCCGTAGA 
CACCAAGAAG GGTTCGATTA 

15 



PTION: SEQ ID NO: 773 

TACTCTGTCT TCACTGGTGT 
GTACCACGAA ATGCAGGAAA 
AGGTCGCGCT TGTCTTCGGT 
CGTGTCGCTC TTACTGGTCT 
GGGTCAGGAT GGTAAGTTAT 
CCTCTACGAA TCCATGCCTC 
TCGGAGTTTG CGGCAATTTC 
TGTTTGACGC GTATCGATGC 
TTGACAACAT TTTCCGATTC 
CTCGGTCGTA TTCCCTCTGC 
CATGGGTGCC ATGCAGGAAC 
CCTCCGTC 



CGGTGAGCGT 5 0 

CCTCCGTCAT 100 

CAGATGAACG 150 

TACCGTTGCC 200 

ATCGTTTTTA 250 

CGTTGGTGAA 300 

TGCCGTCGGC 350 

TAACAACAAT 400 

ACCCAGGCCG 4 50 

CGTCGGTTAC 500 

GTATTACCAC 550 

578 



2) INFORMATION FOR SEQ ID NO: 774 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1123 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhizopus oryzae 
30 (B)„ STRAIN: ATCC 56015 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 774 

AACTTACCYG CTATCTTGAA CGCTCTCGAA GTCCAAGATC ACTCTGGTGG 50 

35 ACGTCTTGTC CTTGAAGTTC GCTCAACACT TGGGTGAAAA TACTGTCCGT 100 

ACTATTGCTA TGGATGGTAC TGAAGGTAAG CTATACTATA ACCGTKTTAT 150 

CCGAGTATGA TATTAAC TTG AAAAAAGGTC TCGTYCGTGG TCAAAAGGTT 200 

ATTGACACTG GTGCTCCCAT TACCATTCCT GYTGGTAAGG AAGTTCTCGG 250 

TCGTATCATT AACGTCATTG GTGAACCCAT CGATGAACGT GGTCCTATCA 300 

40 ACGCCAAGAG CCAACGTCCC ATTCACGCCG AAGCTCCCGA ATTCGTTGAC 350 

CAATCTCCTA CTCCCGAAAT TCTTGAAACT GGTATCAAGG TTGTCGACTT 400 

GTTGGCTCCT TATGCTCGTG GTGGTAAGAT TGGTCTTTTC GGTGGTGCTG 450 

GTGTCGGTAA GACTGTGTTG ATTCAAGAAT TGATTAACAA CATCGCCAAG 500 

GCTCACGGTG GTTACTCTAT TTTCTGTGGT GTCGGTGAAC GTACTCGTGA 550 

45 AGGTAACGAT CTTTACCACG AAATGATTGA AACTGGTGTC ATCAAGCTCG 600 

ATGGTGACTC CAAGTGTGCT CTTGTCTTTG GTCAAATGAA CGAACCCCCA 650 

GGAGCTCGTG CCCGTGTTGC CTTGACTGGT TTGACCATTG CTGAATACTT 700 

CCGTGATGAT GAAGGTCAAG ATGTGTTGCT TTTCATTGAT AACATTTTCC 750 

GTTTCACCCA AGCTGGTTCW GAAGTATCTG CCCTTTTGGG TCGTATTCCC 800 

50 TCCGCTGTCG GTTACCAACC CACTCTTTCT ACTGATATGG GTGGTATGCA 850 

AGAACGTATT ACAACCACCA AGAACGGTTC CATTACATCC GTCCAAGCTG 900 

TCTACGTCCC TGCTGATGAT TTGACCGATC CTGCTCCCGC CACCACTTTT 950 

GCTCACTTGG ATGCCACCAC TGTCTTGTCT CGTTCCATTG CCGAATTGGG 1000 

TATTTACCCT GCCGTCGATC CTCTTGAYTC CAAGTCTCGT ATCTTGGATC 1050 

55 CTCGTATCGT TGGTGACGAA CACTACAAGG TCGCTACCGA AGTTCAACAA 1100 

ATCCTTCAAA ACTACAAGTC TCT 1123 



60 2) INFORMATION FOR SEQ ID NO: 775 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Scopulariopsis koningii 

(B) STRAIN: ATCC 38745 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 775 

ATTCAGGAGC TCATCAACAA CATTGCCAAG GCTCACGGTG GTTACTCTGT 50 

GTTCACTGGT GTCGGTGAGC GTACCCGTGA GGGTAACGAT CTGTACCACG 100 

AAATGCAGGA GACTTCGGTC ATTCAGCTCG AGGGCGAGTC CAAGGTCGCG 150 

CTTGTGTTCG GTCAGATGAA CGAGCCCCCC GGTGCCCGTG CCCGTGTCGC 2 00 

CCTTACCGGT CTGACCGTTG CCGAGTACTT CCGTGACGAG GAGGGCCAGG 250 

ATGGTGAGTA ACCGACGAAG TCTGAGATCT TGTCGGGCAT TATTCTAACG 300 

ACAACTAGTG CTTCTCTTCA TCGACAACAT TTTCCGCTTC ACCCAGGCCG 350 

GTTCCGAGGT GTCCGCGCTT CTCGGCCGTA TCCCCTCTGC CGTCGGTTAC 400 

CAGCCCACCC TGGCCGTCGA CATGGGAGGT ATGCAGGAGC GTATTACCAC 450 

GACTCAGAAG GGCTCGATTA CCTCGGT 477 



2 ) INFORMATION FOR SEQ ID NO: 776 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 610 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: ATCC 8125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 776 

TCCGGAGTTG ATTGTAAGTC ATTTGAAACC CAGCCCCAAG AAACAGAAGC 50 

TAGGCGAAAA TTGGACAATT GAGCAATTTA GCCATTGGAG AAAAGAAATT 100 

TCGAGTATTA ATTGTTTTTA TAGAACAACA TTGCCAAGGC TCACGGTGGT 150 

TACTCTGTCT TCACTGGTGT CGGAGAGCGT ACCCGTGAAG GAAACGATCT 200 

CTACCATGAA ATGCAGGAGA CCCGTGTCAT TCAGCTTGAT GGCGAGTCCA 250 

AGGTCGCCCT GGTCTTCGGC CAGATGAACG AGCCCCCAGG TGCCCGTGCC 300 

CGTGTTGCTC TTACTGGTTT GACCATTGCT GAGTACTTCC GTGATGAGGA 350 

AGGTCAAGAC GGTGAGTTTY TTATGGATAA AARATTTTTT TTTTTTTTTT 400 

TTTTTTMAAR AAATTCATGT TCTAACAAAG TATATCCTAG TGCTTCTCTT 450 

CATCGACAAC ATTTTCCGTT TCACTCAGGC TGGTTCCGAA GTGTCTGCCC 500 

TGCTTGGTCG TATCCCATCT GCCGTCGGTT ACCAACCCAC TCTTGCCGTC 550 

GACATGGGTG GTATGCAGGA ACGTATTACC ACCACCAAGA AGGGATCCAT 600 

TACCTCCGTC 610 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton tonsurans 

(B) STRAIN: ATCC 56185 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 777 

GGAGTTGATT GTAAGTCATT TGAAACCCAG CCCCAAGAAA CAGAAGCTAG 50 

GTGAAAATTG GACAATTGAA CAATTTAGCC CTTGGAGAAA AGAAATTTCG 100 

AGTATTAATT ATTTTTATAG AACAACATTG CCAAGGCTCA CGGTGGTTAC 150 

TCTGTCTTCA CTGGTGTCGG AGAGCGTACC CGTGAAGGAA ACGATCTCTA 2 00 

CCATGAAATG CAGGAGACCC GTGTCATTCA GCTTGATGGC GAGTCCAAGG 2 50 

TCGCCCTGGT CTTCGGCCAG ATGAACGAGC CCCCAGGTGC CCGTGCCCGT 3 00 

GTTGCTCTTA CTGGTTTGAC CATTGCTGAG TACTTCCGTG ATGAGGAAGG 350 

TCAAGACGGT GAGTTTCTTA TGGATGAAAG ATTTTTTTTT TTCAAGAAAT 400 

TCATGTTCTA ACAAAGTGTA TTCTAGTGCT TCTCTTCATC GACAACATTT 450 

TCCGTTTCAC TCAGGCTGGT TCCGAAGTGT CTGCCCTGCT TGGTCGTATC 500 

CCATCTGCCG TCGGTTACCA ACCCACTCTT GCCGTCGACA TGGGTGGTAT 550 

GCAGGAACGT ATTACCACCA CCAAGAAGGG ATCCATTACC TCC 593 



2) INFORMATION FOR SEQ ID NO: 778 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1141 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichosporon cutaneum 

(B) STRAIN: ATCC 62965 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 778 

CCGTGGTCAA GAAGTTATTG ACACTGGTGC CCCAATTACC ATTCCTGTTG 50 

GTCGTGGTAC TCTTGGTAGA ATTATCAACG TCATTGGTGA AC CAATTGAC 100 

GAACGTGGCC CTATCAAGGC TTCTAAGTAT GCTCCTATCC ATACTGAACC 150 

ACCAACCTTT GCTGAACAAT CTACTTCTGC TGAAGTTCTT GAAACCGGTA 200 

TCAAGGTTGT CGATCTTCTT GCTCCTTACG CCCGTGGTGG TAAGATTGGT 250 

CTTTTCGGTG GTGCTGGTGT CGGTAAGACT GTCTTCATTC AAGAACTTAT 300 

TAACAACATT GCCAAGGCTC ACGGTGGTTT CTCTGTCTTC ACTGGTGTCG 350 

GTGAAAGAAC CCGTGAAGGT AACGATCTTT ACCGTGAAAT GAAGGAAACT 400 

GGTGTCATCA ACCTCGAAGG TGACTCCAAG GTCGCTCTCG TTTTCGGTCA 450 

AATGAACGAA CCTCCAGGTG CCCGTGCCCG TGTCGCTTTG ACTGGTCTTA 500 

CCATTGCCGA ATACTTCCGT GATGAAGAAG GACAAGATGT CTTGCTTTTC 550 

GTTGACAACA TTTTCAGATT CACCCAAGCC GGTTCTGAAG TCTCTGCTCT 600 

TTTGGGTCGT ATTCCATCTG CCGTCGGTTA CCAACCTACC CTTGCTACCG 650 
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ATATGGGTGC CCTCCAAGAA CGTATTACCA 
ACATCTGTCC AAGCCGTCTA TGTCCCAGCA 
CCCAGCCACC ACTTTCGCTC ACTTGGACGC 
CCATTTCCGA ATTGGGTATC TACCCAGCTG 
5 TCTCGTCTTT TGGATCCTGA AGTTATTGGA 
CACTCAAGTT CAACAAACCC TCCAAGCTTA 
TTGCCATTTT GGGTATGGAT GAATTGTCTG 
GAACGTGCCC GTAAGATCCA AAGATTCCTT 
CGAAGTTTTC ACTGGTATCG AAGGTCGTCT 
10 TCAGATCTTT CAAGGAAATC CTTGAAGGTA 



PCT/CA00/01150 
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2) INFORMATION FOR SEQ ID NO: 779 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1093 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Cladophialophora carrionii 

(B) STRAIN: ATCC 22864 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 779 

30 GCTGAAGGCC GAACGTGAGC GTGGTATCAC CATCGATATC GCGCTCTGGA 50 

AGTTCGAGAC TCCCAAGTAC TTCGTCACCG TCATCGATGC CCCTGGTCAT 100 

CGTGACTTCA TCAAGAACAT GATCACTGGT ACCTCCCAGG CTGATTGTGC 150 

TATTCTCATC ATTGCCGCTG GTACTGGTGA GTTCGAGGCC GGTATCTCCA 200 

AGGATGGCCA GACCCGTGAG CATGCTCTGC TCGCCTACAC CCTGGGCGTG 250 

35 AAGCAGCTTA TCGTCGCCAT CAACAAGATG GACACCACCA AATGGTCTGA 3 00 

GGATCGTTTC AACGAAATCA TCAAGGAGAC TTCCAACTTC ATCAAGAAGG 350 

TCGGATACAA CCCCAAGTCC GTTCCATTCG TGCCCATCTC CGGCTTCAAC 400 

GGTGACAACA TGATCGACGT CTCCACCAAT GCCCCCTGGT ACAAGGGCTG 450 

GGAAAAGGAG TCCAAGGCTG GCAAGGCCAC CGGCAAGACC CTCCTTGAGG 500 

40 CTATCGACTC CATCGACCCT CCTGCTCGTC CCACCGACAA GCCTCTCCGT 550 

CTCCCACTCC AGGATGTCTA CAAGATTTCT GGTATCGGCA CGGTGCCCGT 600 

CGGTCGTGTT GAGACTGGTA CCATCAAGGC CGGTATGGTC GTCACCTTTG 650 

CCCCCGCCAA CGTCACCACT GAAGTCAAGT CCGTCGAAAT GCACCACGAA 700 

CAGCTCGCCG AGGGCGTTCC GGGTGACAAC GTCGGCTTCA ACGTCAAGAA 750 

45 CGTCTCCGTG AAGGAGGTTC GTCGTGGAAA CGTTGCTGGT GACTCCAAGA 800 

ACGACCCCCC CAAGGGTGCC GACTCCTTCA ACGCCCAGGT CATCGTCCTC 850 

AACCACCCTG GTCAGGTCGG TGCTGGCTAC GCCCCGGTCT TGGATTGCCA 900 

CACTGCCCAC ATTGCCTGCA AGTTCTCTGA GCTCCTCGAG AAGATCGATC 950 

GTCGTACCGG CAAGTCCATG GAAAACAACC CCAAGTTCAT CAAGTCTGGT 1000 

50 GATGCTGCCA TCGTGAAGAT GGTTCCCAGC AAGCCTATGT GCGTTGAGGC 1050 

TTTCACCGAC TACCCTCCTC TTGGTCGTTT CGCCGTCCGT GAC 1093 



55 2) INFORMATION FOR SEQ ID NO: 780 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 
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WO 01/23604 PCT/CAOO/01150 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cunninghamella bertholletiae 

(B) STRAIN: ATCC 42115 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 780 



TACTTGTAAA TGGTCTCAAG ATCGTTACAA CGAAATTGTT AAGGAAGTTT 50 

CTTCCTTCAT CAAGAAGATT GGTTACAATC CTAAATCCGT TCCTTTCGTY 100 

CCTATCTCTG GTTGGCACGG TGATAACATG TTGGAAGCTT CTACCAACAT 150 

GCCTTGGTAC AAGGGATGGA CCAAGGAAAC TAAAGCTGGT TCTTCCACTG 200 

15 GTAAGACTCT CTTAGAAGCC ATTGACAGCA TTGAACCTCC TACCCGTCCT 250 

TCTGACAAGC CTTTACGTCT TCCTTTACAA GATGTTTACA AGATTGGTGG 300 

TATTGGTACT GTCCCTGTTG GTCGTGTTGA AACTGGTGTC ATCAAGGCTG 350 

GTATGGTTGT TACYTTCGCT CCCGCTAACG TCACCACTGA AGTTAAGTCC 400 

GTTGAAATGC ATCACGAACA ATTAGAACAA GGTGTTCCTG GTGACAACGT 450 

20 TGGTTTCAAC GTCAAGAACG TTTCCGTTAA GGATATCCGT CGTGGTAACG 500 

TCTGTTCCGA CTCCAAGAAC GACCCCGCTA AGGAATCTGC TTCCTTCAAC 550 

GCTCAAGTTA TCGTCTTGAA CCACCCTGGT CAAATTGGTG CTGGTTATGC 600 

CCCAGTTCTT GACTGTCACA CTGCTCACAT TGCTTGTAAG TTCGCTGAAT 650 

TATTAGAAAA GATCGATCGT CGTTCCGGTA AGAAACTCGA AGATGCTCCT 700 

25 AAATTCGTTA AATCTGGTGA CTCTGCTATC GTTAAGATGG TTCCTTCCAA 750 

GC 752 



30 2) INFORMATION FOR SEQ ID NO : 781 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 8 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Curvularia lunata 

(B) STRAIN: ATCC 26425 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 781 



CAAGTGGTCT GAGGACCGTT ACCAGGAAAT CATCAAGGAG ACCTCCAACT 50 

TCATCAAGAA GGTCGGCTAC AACCCCAAGC ACGTTCCCTT CGTCCCCATC 100 

TCCGGTTTCA ACGGAGACAA CATGATTGAG GCCTCCACCA ACTGCCCCTG 150 

GTACAAGGGT TGGGAGAAGG AGACCAAGGC CAAGGCCACT GGTAAGACCC 200 

50 TCCTTGAGGC CATCGACGCC ATCGACCCTC CTGTCCGTCC TACCGACAAG 250 

CCCCTCCGCC TTCCCCTCCA GGATGTTTAC AAGATTGGTG GTATTGGCAC 3 00 

GGTCCCCGTC GGTCGTGTCG AGACCGGTAT CATCAAGCCC GGTATGGTCG 350 

TCACCTTCGC CCCCGCTGGT GTCACCACCG AAGTCAAGTC CGTCGAGATG 4 00 

CACCACGAGC AGCTTACTGA GGGTGTCCCC GGTGACAACG TCGGCTTCAA 450 

55 CGTCAAGAAC GTCTCCGTCA AGGAGATCCG TCGTGGTAAC GTTGCCGGTG 500 

ACTC CAAGAA CGACCCCCCC AAGGGTTGCG AGTCCTTCAA CGCCCAGGTC 550 

ATCGTCCTCA ACCACCCTGG TCAGGTCGGT GCCGGTTACG CCCCAGTCCT 600 

TGACTGCCAC ACTGCCCACA TTGCCTGCAA GTTCTCCGAG CTCCTCGAGA 650 

AGATCGACCG CCGTACCGGA AAGTCTGTTG AGAACTCCCC CAAGTTCATC 700 

60 AAGTCCGGTG ACGCCGCCAT CGTCAAGA 728 
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2) INFORMATION FOR SEQ ID NO: 782 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1145 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

r 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Fonsecaea pedrosoi 

(B) STRAIN: ATCC 18831 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 782 

20 TGAAGTCCGA GCGTGAGCGT GGTATCACCA TCGATATCGC CCTCTGGAAG 50 

TTCGAGACTC CCAAGTAAGG CTCAACAGAC ACAACAAGCA AATGCATACT 100 

CGCTAACCTA TTCACCCACC ACAGGTACAA CGTCACCGTC ATTGACGCCC 150 

CCGGTCACCG TGATTTCATC AAGAACATGA TCACTGGTAC CTCCCAGGCT 200 

GACTGCGCCA TTCTCATCAT TGCCGCCGGT ACTGGTGAGT TCGAGGCCGG 250 

25 TATCTCCAAG GACGGTCAGA CCCGTGAGCA CGCTCTTCTC GCCTACACCC 3 00 

TTGGTGTCAA GCAGCTCATC GTTGCCATCA ACAAGATGGA CACCACCAAG 350 

TGGTCTGAGG CCCGTTACCA GGAGATCATC AAGGAGACCT CCGGTTTCAT 4 00 

CAAGAAGGTC GGCTTCAACC CCAAGCACGT TCCCTTCGTG CCCATCTCCG 450 

GTTTCAACGG TGACAACATG ATCGACGTCT CCACCAACTG CCCCTGGTAC 500 

30 AAGGGTTGGG AGAAGGAGAC CAAGGCCAAG GCCACCGGCA AGACCCTCCT 550 

CGAGGCCATT GACGCCATCG \ACCCCCCCAC TCGTCCCACC GACAAGCCCC 600 

TCCGTCTTCC CCTYCAGGAT GTCTACAAGA TCGGTGGTAT TGGCACGGTT . 650 

CCCGTCGGTC GTGTCGAGAC > CGGTACCATC AAGGCCGGCA TGGTCGTCAC 700 

CTTCGCCCCC GCTGGTGTCA CCACTGAGGT CAAGTCCGTC GAGATGCACC 750 

35 ACGAGCAGCT TCCCGAGGGT CTCCCCGGTG ACAACGTCGG CTTCAACGTC 800 

AAGAACGTCT CCGTCAAGGA GATCCGTCGT GGCAACGTCG CCGGTGACTC 850 

CAAGAACGAC CCCCCCAAGG GCTGCGACAG CTTCAACGCC CAGGTCATCG 900 

TCCTCAACCA CCCCGGTCAG GTCGGCGCCG GCTACGCGCC CGTCCTCGAC 950 

TGCCACACTG CTCACATTGC TTGCAAGTTC TCTGAGCTCC TCGAGAAGAT 1000 

40 CGACCGCCGT ACCGGCAAGT CCATTGAGGC CAGCCCCAAG TTCATCAAGT 1050 

CTGGTGACGC CGCCATCGTC AAGATGGTTC CCTCCAAGCC TATGTGCGTT 1100 

GAGGCCTTCA CCGACTACCC CCCTCTTGGA CGTTTCGCCG TCCGT 1145 



45 



55 



2) INFORMATION FOR SEQ ID NO: 783 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1151 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microsporum audouinii 

(B) STRAIN: ATCC 11347 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 783 
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GCTCAAGGCT GAGCGTGAGC GTGGTATCAC CATTGACATC GCCCTCTGGA St) 

AGTTCGAGAC CCCCAAGTAC ATGGTCACCG TCATCGGTAT GCTTTATCTG 100 

TTTCCCATTT ATAGTTGCGA CAAGTAACTA ATAAAAAGTA GATGCCCCCG 150 

5 GACACCGTGA CTTCATCAAG AACATGATTA CTGGTACCTC CCAGGCCGAC 200 

TGCGCTATTC TCATCATTGC TGCCGGTACT GGTGAGTTCG AGGCTGGTAT 250 

CTCCAAGGAT GGCCAGACTC GTGAGCACGC CCTGCTCGCT TTCACCCTCG 300 

GTGTCAAGCA GCTCATCGTT GCCATCAACA AGATGGACAC CACCAACTGG 350 

TCTGAGTCCC GTTTCGGTGA AATCATCAAG GAAGTCACCA ACTTCATCAA 400 

10 GAAGGTCGGC TACGACCCCA AGGGTGTCCC ATTCGTCCCA ATCTCTGGCT 450 

TCAACGGTGA CAACATGATT GAGCCCTCCA CCAACTGCCC ATGGTACAAG 500 

GGATGGAACA AGGAGACCAA GGCCGGTGGC AAATCCTCTG GTAAGACCCT 550 

CCTTGAGGCC ATCGATGCCA TTGACATGCC CACTCGTCCC ACCGACAAGC 600 

CTCTCCGTCT CCCACTCCAG GATGTCTACA AGATCTCTGG TATCGGAACA 650 

15 GTACCAGTCG GTCGTGTTGA GACTGGTATC ATCAAGCCTG GTATGGTTGT 700 

CACTTTCGCC CCCGCCAACG TCACCACTGA AGTCAAGTCC GTCGAAATGC 750 

ACCACCAGCA GCTCGTTCAG GGTGTTCCCG GTGACAACGT TGGCTTCAAC 800 

GTCAAGAACG TCTCTGTCAA GGAAGTCCGC CGTGGTAACG TTGCCGGTGA 850 

TTCCAAGAAC GACCCACCAT CTGGCTGCGC CTCTTTCAAG GCCCAGGTCA 900 

20 TCGTCCTCAA CCACCCCGGC CAGATCGGTG CTGGTTACGC CCCAGTCCTC 950 

GACTGCCACA CTGCCCACAT TGCTTGCAAG TTCTCTGAGC TTCTTGAGAA 1000 

GATTGACCGC CGTACTGGTA AATCCGTCGA AACCAGCCCT AAGTTCGTCA 1050 

AGTCTGGTGA TGCCGCTATT GCCACCATGG TTCCATCCAA GCCCATGTGC 1100 

GTTGAGGCTT TCACTGACTA CCCACCACTT GGTCGTTTCG CCGTCCGTGA 1150 

25 C 1151 



2) INFORMATION FOR SEQ ID NO: 784 



30 



(i) SEQUENCE CHARACTERISTICS:' 

(A) LENGTH: 979 bases 

(B) TYPE:' Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Mucor circinelloides 

(B) STRAIN: ATCC 38592 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 784 

45 CCAAGTGGTC TCAAGATCGT TACAACGAAA TTGTCAAGGA AGTTTCCGGT 50 

TTCATCAAGA AGATCGGTTT CAACCCCAAG TCCGTTCCTT TCGTTCCCAT 100 

TTCTGGCTGG CACGGTGATA ACATGTTGGA TGAATCCACC AACATGCCCT 150 

GGTTCAAGGG ATGGAACAAG GAGACCAAGG CCGGTTCCAA GACTGGTAAG 200 

ACTCTCCTCG AAGCCATCGA TGCCATTGAG CCCCCTGTCC GTCCTTCTGA 250 

50 CAAGCCTCTC CGTCTTCCTC TTCAAGATGT CTACAAGATT GGTGGTATTG 300 

GTACAGTTCC CGTCGGTCGT GTTGAAACTG GTACTATCAA GGCTGGTATG 350 

GTTGTCAACT TCGCTCCCGC TGCTGTCACC ACTGAAGTTA AGTCCGTTGA 400 

AATGCATCAC GAAACCCTCT CTGAAGGTCT CCCCGGTGAC AACGTTGGTT 450 

TCAACGTCAA GAACGTCTCC GTCAAGGATA TCCGTCGTGG TAACGTCTGT 500 

55 TCCGACTCCA AGAACGATCC CGCTAAGGAA TCTGCCTCTT TCACTGCTCA 550 

AGTTATTATC TTGAACCATC CCGGTCAAAT CTCTGCTGGT TACGCACCAG 600 

TTCTCGATTG TCACACTGCT CACATCGCCT GTAAGTTCTC TGAACTCATT 650 

GAGAAGATTG ATCGTCGTTC CGGTAAGTAC CTGCATCTGT CAGAATTGAA 700 

GGTCCGCCGT TATAGCAAAG GCTGGGTTTA AATGTTGGGG TTTGTCTGAT 750 

60 CTATAATGAT GATTGCTCCT TCAATTTTTG ACATAATTTG ATGATCTGAA 800 
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TTGTGTTGCT AACGTCGCAT TTGCTTCTTT TGCTTCTTTT GCATGTAGGT 
AAGAAGATGG AAGATGCTCC CAAAGTAAGT ATTACGATTG ATGGACAATT 
AAAATAGAAT ACTAACAATT ATTGTTTATA GTTCGTCAAG TCTGGTGACT 
CTGCTATCGT CAAGATGGTT CCCTCCAAG 



8Ett 



900 
950 
979 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 785 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1099 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Phialaphora verrucosa 

(B) STRAIN: ATCC 3 8561 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 785 

GGACAAGCTG AAGGCCGAAC GTGAGCGTGG TATCACCATC GATATCGCGC 50 

TCTGGAAGTT CGAGACTCCC AAATACTTCG TCACCGTCAT TGATGCCCCT 100 

GGTCATCGTG ACTTCATCAA GAACATGATC ACTGGTACCT CCCAGGCTGA 150 

TTGTGCCATC CTCATCATTG CCGCCGGTAC CGGTGAGTTC GAAGCCGGTA 200 

TCTCCAAGGA TGGCCAGACC CGTGAGCACG CTCTCCTCGC CTACACCCTA 250 

GGTGTGAAGC AGCTTATCGT CGCCATCAAC AAGATGGACA CCGCCAAATG 300 

GTCCGAGGAT CGGTTCAACG AAATCATCAA GGAGACTTCC AACTTCATCA 350 

AGAAGGTCGG ATACAACCCC AAGTCCGTCC CGTTCGTGCC CATCTCCGGT 400 

TTCAACGGTG ACAACATGAT CGACGTCTCC TCCAACGCCC CCTGGTACAA 450 

GGGTTGGGAG AAGGAGACCA AGGCCGGCAA GGCCACTGGC AAGACCCTCC 500 

TCGAGGCCAT CGACGCGATT GACCCTCCTA CTCGTCCCAC CGACAAGCCC 550 

CTCCGTCTCC CTCTCCAGGA TGTCTACAAG ATCTCTGGTA TCGGCACGGT 600 

GCCCGTTGGT CGTGTTGAGA CCGGTACCAT CAAAGCCGGT ATGGTCGTCA 650 

CCTTCGCTCC CGCCAACGTC ACCACTGAAG TCAAGTCCGT CGAAATGCAC 700 

CACGAACAGC TCGCCGAAGG TGTTCCAGGT GACAATGTCG GCTTCAACGT 750 

CAAGAACGTC TCCGTCAAGG AGGTTCGTCG TGGAAACGTT GCCGGTGACT 800 

CCAAGAATGA CCCCCCCAAG GGTGCCGACT CCTTCAACGC TCAGGTCATC 850 

GTCCTCAACC ACCCTGGTCA GGTCGGTGCC GGCTACGCCC CGGTCTTGGA 900 

TTGCCACACT GCCCACATTG CTTGCAAGTT CTCTGAGCTC CTCGAGAAGA 950 

TCGATCGTCG TACCGGCAAG TCCATGGAAA ACAACCCCAA GTTTATCAAG 1000 

TCTGGTGATG CTGCCATCGT GAAGATGGTT CCCAGCAAGC CTATGTGCGT 1050 

TGAGGCCTTC ACCGACTATC CTCCTCTTGG TCGTTTCGCC GTCCGTGAC 1099 



2) INFORMATION FOR SEQ ID NO: 786 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saksenaea vasifoirmis 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 786 

5 ACCACCAAGT GGTCTGAGGC CCGTTACCAG GAGATCATCA AGGAGACCTC 50 

CGGYTTCATC AAGAAGGTCG GCTTCAACCC CAAGCACGTT CCCTTCGTGC 100 

CCATCTCCGG TTTCAACGGT GACAACATGA TCGACGTCTC CACCAACTGC 150 

CCCTGGTACA AGGGTTGGGA GAAGGAGACC AAGGCCAAGG CCACCGGCAA 2 00 

GACCMTCCTC GAGGCCATTG ACGCCATCGA CCCCCCYAGY CGTCCCACCG 250 

10 ACAAGCCCCT YCGTCTTCCC CTMCAGGATG TYTACAAGAT TGGCGGTATT 300 

GGCACGGTTC CCGTCGGTCG TGTYGAGACC GGTRCCATCA AGGGTGGCAT 350 

GGTCGTCACC TTCCCCCCCG CTGGTGTCAC CACTGAGGTC AAGTCCGTCG 4 00 

AGATGCACCA CGAGCAGCTC GCCGAGGGTS TCCCCGGTGA CAACGTCGGC 450 

TTCAACGTCA AGAACGTCTC CGTCAAGGAG ATCCGTCGTG GCAACGTTGC 500 

15 CGGTGACTCC AAGAACGACC CCCCCAAGGG CTGCGACAGC TTCAACGCCC 550 

AGGTCATCGT CCTCAACCAC CCCGGTCAGG TCGGCGCCGG CTACGCGCCS 600 

GTCCTSGACT GCCACACTGC TCACATTGCC TGCAAGTTCT CTGAGCTCCT 650 

CGAGAAGATC GACCGCCGTT CCGGCAAGTC CATCGAGTCC GGCCCCAAGT 700 

TCATCAAGTC TGGTGACGCC GCCATCGTCA AGATGGTTCC CTCCAAGCCC 750 

20 



2) INFORMATION FOR SEQ ID NO: 787 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1084 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Syncephalastrum racemosum 
35 (B) STRAIN: ATCC 32330 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 787 

GCTCAAGGCC GAGCGTGAGC GTGGTATCAC CATCGATATC GCTCTCTGGA 50 

40 AGTTCGAGAC CCCCAAGTAC CACGTCACCG TCATTGATGC CCCCGGCCAT 100 

CGTGATTTCA TCAAGAACAT GATCACTGGT ACCTCGCAGG CTGACTGCGG 150 

TATCCTCATC ATTGCCGCCG GTACTGGTGA GTTCGAGGCT GGTATCTCCA 2 00 

AGGATGGCCA GACCCGTGAG CACGCTCTGC TTGCCTTCAC CCTCGGTGTC 250 

CGTCAGCTGA TCGTCGCCAT CAACAAGATG GACTCGACCA AGTACTCTGA 300 

45 GGCCCGTTAC AACGAAATCG TCAAGGAGGT CTCCACCTTC AT C AAGAAG A 3 50 

TCGGTTTCAA CCCCAAGTCC GTTCCCTTCG TCCCCATCTC TGGCTGGAAC 400 

GGTGACAACA TGTTGGAGGA GTCCTCCAAC ATGCCCTGGT TCAAGGGCTG 450 

GAAGAAGGAG ACCAAGGCTG GCGAGAAGTC CGGCAAGACC CTCCTTGAGG 500 

CCATTGACAA CATTGACCCC CCGGTCCGTC CCTCGGACAA GCCCCTCCGT 550 

50 CTTCCCCTCC AGGATGTCTA CAAGATCGGT GGTATCGGCA CAGTCCCCGT 600 

CGGTCGTGTC GAGACTGGTG TCATCAAGGC TGGTATGGTC GTGACCTTCG 650 

CCCCCGCCAA CGTCACCACT GAAGTCAAGT CCGTCGAGAT GCACCACGAG 700 

CAGCTCGTCG AGGGTGTCCC CGGTGACAAC GTCGGTTTCA ACGTCAAGAA 750 

CGTTTCCGTC AAGGATATCC GCCGTGGTAA CGTCTGCTCT GACTCCAAGA 800 

55 ACGACCCCGC CAAGGAGTCT GCCTCGTTCA CCGCCCAGGT CATCGTCCTG 850 

AACCACCCCG GTCAGATCGG TGCCGGTTAC GCCCCGGTTC TTGACTGCCA 900 

CACCGCTCAC ATTGCCTGCA AGTTCGCTGA GCTCCTCGAG AAGATCGACC 950 

GTCGTTCCGG YAAGAAGCTC GAAGAGTCCC CCAAGTTCGT CAAGTCGGGT 1000 

GACTCCGCCA TCGTCAAGAT GGTTCCCTCC AAGCCCATGT GCGTTGAGGC 1050 

60 CTACACTGAG TACCCCCCTC TTGGCCGTTT CGCC 1084 
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2) INFORMATION FOR SEQ ID NO: 788 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1155 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Trichophyton tonsurans 

(B) STRAIN: ATCC 56185 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 788 

20 GCTCAAGGCC GAGCGTGAGC GTGGTATCAC CATCGATATC GCCCTCTGGA 50 

AGTTCGAGAC CCCCAAGTAC AATGTCACCG TCATTGGTAT GTTTTTCTTT 100 

ACCTTTCCCC TCCATCGTCT TGCTGTGCCA TAACTAACGA GAGTAGACGC 150 

CCCCGGTCAC CGTGACTTCA TCAAGAACAT GATCACTGGT ACCTCCCAGG 200 

CTGACTGTGC TATTCTCATC ATTGCTGCCG GTACTGGTGA GTTCGAGGCT 250 

25 GGTATCTCCA AGGATGGCCA GACCCGTGAG CACGCTCTGC TCGCCTTCAC 300 

CCTTGGTGTC AAGCAGCTCA TCGTTGCCAT CAACAAGATG GACACCACCA 350 

ACTGGTCCGA GGACCGTTTC AAGGAAATCA TCAAGGAAGT CACCAACTTC 400 

ATCAAGAAGG TTGGCTACGA CCCCAAGGGT GTTCCATTCG TTCCAATCTC 450 

TGGTTTCAAC GGTGACAACA TGATTGAGGC CTCCAGCAAC TGCCCATGGT 500 

3 0 ACAAGGGATG GAACAAGGAG ACCAAGGCCG GTGGTGCCAA GACTGGCAAG 550 

ACCCTYCTCG AGGCCATCGA TGCCATCGAC ATGCCAACCC GTCCTACCGA 600 

CAAGCCCCTY CGTCTCCCAC TCCAGGATGT CTACAAGATC TCTGGTATCG 650 

GAACTGTACC AGTCGGTCGT GTTGAGACCG GTATCATCAA GCCTGGTATG , 700 

GTCGTCACCT TCGCCCCTGC CAACGTCACC ACTGAAGTCA AGTCCGTCGA 750 

35 AATGCACCAC CAGCAGCTTC AGCAGGGTGT CCCCGGTGAC AACGTCGGCT 800 

TCAACGTCAA GAACGTTTCC GTCAAGGAAG TCCGCCGTGG TAACGTTGCC 850 

GGTGACTCCA AGAACGACCC ACCATCCGGC TGTGCCTCCT TCAACGCCCA 900 

GGTCATCGTC CTCAACCACC CCGGCCAGAT CGGTGCTGGT TACGCCCCAG 950 

TCCTCGACTG CCACACTGCT CACATTGCTT GCAAGTTCGC TGAGCTCCTC 1000 

40 GAGAAGATTG ACCGCCGTAC CGGTAAATCC GTCGAAGCCA ACCCCAAGTT 1050 

CGTCAAGTCT GGTGATGCCG CTATCGCCAA GATGGTTCCC TCCAAGCCCA 1100 

TGTGCGTTGA GGCTTTCACT GACTACCCCC CACTTGGTCG TTTCGCCGTC 1150 

' CGTGA 1155 

45 

2 ) INFORMATION FOR SEQ ID NO: 789 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1138 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: ATCC 8125 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 789 

TCAAGGCCGA GCGTGAGCGT GGTATCACCA TCGATATCGC CCTCTGGAAG 50 

TTCGAGACCC CCAAGTACAA TGTCACCGTC ATTGGTATGT TTCTCTTTAC 100 

5 CTTTCCCCTC CATCGTCTTG CTGTGCCATA ACTAACGAGA GTAGACGCCC 150 

CCGGTCACCG TGACTTCATC AAGAACATGA TCACTGGTAC CTCCCAGGCT 2 00 

GACTGTGCTA TTCTCATCAT TGCTGCCGGT ACTGGTGAGT TCGAGGCTGG 250 

TATCTCCAAG GATGGCCAGA CCCGTGAGCA CGCTCTGCTC GCCTTCACCC 300 

TTGGTGTCAA GCAGCTCATC GTTGCCATCA ACAAGATGGA CACCACCAAC 3 50 

10 TGGTCCGAGG ACCGTTTCAA GGAAATCATC AAGGAAGTCA CCAACTTCAT 4 00 

CAAGAAGGTT GGCTACGACC CCAAGGGTGT TCCATTCGTT CCAATCTCTG 450 

GTTTCAACGG TGACAACATG ATTGAGGCCT CCACCAACTG CCCATGGTAC 500 

AAGGGATGGA ACAAGGAGAC CAAGGCCGGT GGTGCCAAGA CTGGCAAGAC 550 

CCTCCTCGAG GCCATCGATG CCATCGACAT GCCAACCCGT CCTACCGACA 600 

15 AGCCCCTCCG TCTCCCACTC CAGGATGTCT ACAAGATCTC TGGTATCGGA 650 

ACTGTACCAG TCGGTCGTGT TGAGACCGGT ATCATCAAGC CCGGTATGGT 700 

CGTCACCTTC GCCCCTGCCA ACGTCACCAC TGAAGTCAAG TCCGTCGAAA 750 

TGCACCACCA GCAGCTTCAG CAGGGTGTCC CCGGTGACAA CGTCGGCTTC 800 

AACGTCAAGA ACGTTTCCGT CAAGGAAGTC CGCCGTGGTA ACGTTGCCGG 850 

20 TGACTCCAAG AACGACCCAC CATCCGGCTG TGCCTCCTTC AACGCCCAGG 900 

TCATCGTCCT CAACCACCCC GGCCAGATCG GTGCTGGTTA CGCCCCAGTC 950 

CTCGACTGCC ACACTGCTCA CATTGCTTGC AAGTTCGCTG AGCTCCTCGA 1000 

GAAGATTGAC CGCCGTACCG GTAAATCCGT CGAAGCCAAC CCCAAGTTCG 1050 

TCAAGTCTGG TGATGCCGCT ATCGCCAAGA TGGTTCCATC CAAGCC TATG 1100 

25 TGCGTTGAGG CTTTCACTGA CTACCCCCCA CTTGGTCG 1138 



2) INFORMATION FOR SEQ ID NO: 790* 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Bipolar is hawaiiensis 

(B) STRAIN: ATCC 26067 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 790 

45 CACCACCAAG TGGTCTGAGG AGCGTTACCA GGAAATCATC AAGGAGACCT 50 

CCAACTTCAT CAAGAAGGTC GGCTACAACC CCAAGCACGT TCCCTTCGTC 100 

CCCATCTCTG GTTTCAACGG AGACAACATG ATTGAGGCTT CCACCAACTG 150 

CCCCTGGTAC AAGGGTTGGG AGAAGGAGAC CAAGGCCAAG GCCACTGGTA 200 

AGACCCTTCT CGAGGCCATC GACGCCATCG ACCCCCCTGT CCGTCCTACC 250 

50 GACAAGCCCC TCCGCCTTCC CCTCCAGGAT GTGTACAAGA TTGGTGGTAT 3 00 

TGGCACGGTT CCCGTCGGTC GTGTCGAGAC CGGTATCATC AAGCCCGGTA 350 

TGGTCGTCAC CTTCGCCCCC GCTGGTGTCA CCACTGAAGT CAAGTCCGTC 400 

GAGATGCACC ACGAGCAGCT TGCCGAGGGT GTCCCCGGTG ACAACGTCGG 450 

CTTCAACGTC AAGAACGTCT CCGTCAAGGA GATCCGTCGT GGTAACGTTG 500 

55 CCGGTGACTC CAAGAACGAC CCCCCCAAGG GTTGCGAGTC CTTCAACGCC 550 

CAGGTCATCG TCCTCAACCA CCCCGGTCAG GTCGGTGCCG GTTACGCACC 600 

AGTCCTTGAC TGCCACACTG CCCACATTGC TTGCAAGTTC TCCGAGCTCC 650 

TCGAGAAGAT TGACCGCCGT ACCGGAAAGT CTGTTGAGAA CTCCCCCAAG 700 

TTCATCAAGT CCGGTGACGC CGCCATCGTC AAGATGGTTC CCTCCAAG 748 

60 
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2) INFORMATION FOR SEQ ID NO: 791 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 958 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 
15 (B) STRAIN: ATCC 14110 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 791 

CGCTATTGTC GTTGTTGCTG CCTCCGACGG TCAGATGTAG GTGGAACATC 50 

20 TTGGGAAATA CGTCGTAAAA CACGGCGCTT ACGTTTTCGC GAATAGGCCC 100 

CAGACTCGTG AGCATTTGCT GCTCGCCCGC CAGGTTGGTG TCCAGAAGAT 150 

CGTTGTCTTC GTCAACAAAA TCGATGCTAT TGATGATCCG GAGATGCTGG 200 

AACTGGTCGA ACTCGAGATG CGTGAGCTGC TGAACAGCTA CGGTTTCGAG 250 

GGTGAAGAGA CTCCGATCAT TTTCGGTTCC GCTCTCTGTG CTCTCGAAGG 3 00 

25 ACGCCGTGAC GACATCGGTA AAGACAGAAT TGAGCAGCTT ATGAACGCTG 350 

TCGACACCTG GATCCCCACT CCTCAGCGTG ACCTCGACAA ACCTTTCTTG 40 0 

ATGTCTGTCG AGGAAGTGTT CTCTATCGCC GGCCGTGGTA CCGTGGCTTC 450 

TGGTCGTGTC GAGCGTGGTA TCTTGAAGAA GGACTCTGAG GTTGAGATTG 50 0 

TTGGAGGCTC CTTCGAACCC AAGAAGACCA AAGTCACCGA CATTGAAACC 550 

30 TTCAAGAAGA GCTGTGATGA ATCGCGTGCT GGTGACAACT CTGGTCTCCT - 600 

CCTGCGTGGT ATCCGACGTG AAGACGTCAA . GCGTGGTATG GTCATTGCTG 650 

TTCCCGGCAG CACGAAAGCT CACGACAAGT. TCCTCGTCTC CATGTACGTC 700 

CTGACCGAGG CGGAGGGTGG TCGTCGTACT GGCTTCGGTG CCAACTACCG 750 

TCCCCAAGTC TTCATCCGTA CTGCAGGTAA GTTCCCGCAC ACCGTGTCCA 800 

35 GATCTTCCGA GAGATTAGCG ATATATGCTA ATGATTCATC AGACGAGGCT 850 

GCTGACCTCA GCTTCCCTGA CGGCGACCAA TCTCGCAGAG TTATGCCTGG 900 

TGACAACGTC GAGATGATCC TGAAGACC C A CCACCCTGTT GCTGCTGAGG 950 

CTGGTCAA 958 

40 

2) INFORMATION FOR SEQ ID NO: 792 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 936 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: ATCC 8125 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 792 



CGTTGTCGCA GCTTCTGACG GTCAAATGTA ATTGAATGCC CGCCCAGACG 5 0 

GATGAAAGGA TTTGACGTTT CTAACATCAT TCTAGGCCTC AGACCAGAGA 100 

60 ACATTTGCTC CTTGCCCGCC AGGTCGGTGT CCAGAAGCTG GTCGTTTTCG 150 

452 
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TTAACAAGGT CGATGCCGTT GAGGACCCAG AGATGTTGGA GCTTG^CGAA "200' 

CTTGAAATGC GTGAACTCCT CAGCCACTAC GGTTTCGAGG GTGAGGAGAC 250 

CCCCATCATT TTTGGCTCTG CTCTCTGTGC CCTCGAGTCC CGTCGACCTG 300 

AGCTTGGTGT CGAGAAGATT GACGAGCTAT TGAACGCCGT CGACACCTGG 350 

5 ATCCCCACCC CCGAGCGCGC CACTGATAAG CCTTTCCTCA TGTCCATTGA 40 0 

GGAAGTGTTC TCTATCTCTG GTCGTGGTAC CGTCGTCTCC GGTCGTGTTG 450 

AGCGTGGTAT CCTCAAGAAG GATTCCGACG TCGAAATTGT TGGTGGCTCT 500 

ACCACCCCTA TCAAGACCAA GGTCACAGAT ATCGAAACCT TCAAGAAGTC 550 

CTGCGATGAA TCTCGAGCTG GTGACAACTC TGGTCTCCTT CTCCGAGGTA 600 

10 TCAAGCGTGA GGACTTGAAG CGTGGAATGG TTGTTGCTGC CCCCGGATCC 650 

ACCAAGGCTC ACACCGACTT CATGGTCTCC CTCTACGTCC TGACTGAGGC 700 

TGAGGGTGGT CGTTCCAACG GCTTCACCCA CAAGTACCGC CCCCAAATGT 750 

TCATCCGTAC TGCTGGTATG TAACCCAAGT TTCCGCTATT TACTAAGTAG 800 

ATCATTGCTA ACTTGTATTT CCTTCCGTAG ACGAAGCCGC ATCTTTCAGC 850 

15 TGGCCTGGAG AAGACCAAGA CAAGAAGGCT ATGCCTGGTG ACAACGTCGA 900 

GATGATTTGC AAGACCCTCC ACCCCATTGC TGCCGA 936 



20 .2) INFORMATION FOR SEQ ID NO: 793 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 " (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 793 

TTATTGTTGC TGCTGGTACT 2 0 



35 



45 



50 



2) INFORMATION FOR SEQ ID NO: 794 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 794 

GACGACAAGT CGGTGAACTT 2 0 



55 2) INFORMATION FOR SEQ ID NO: 795 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 795 

ACTTGCACGC GATGTGGCAG 



2) INFORMATION FOR SEQ ID NO: 796 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 796 
GGTCCAATGC CWCAAACWAG A 



2) INFORMATION FOR SEQ ID NO: 797 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 797 
CATTAAGAAT GGYTTATCTG TSKCTCT 



2) INFORMATION FOR SEQ ID NO: 798 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) - TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvnm 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 798 
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5 2) INFORMATION FOR SEQ ID NO: 799 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 799 
2 0 ACCTGTGAAT ACAAGCAATC T 21 



2) INFORMATION FOR SEQ ID NO: 800 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 800 

GATGAAATCT TCAACGAAGT TGAT 24 

40 

2) INFORMATION FOR SEQ ID NO: 801 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



50 



55 



60 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 801 

ACAACACCGA GAAGATCCCA 2 0 
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2) INFORMATION FOR SEQ ID NO: 802 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 802 

TTGCCATTTC TGGTTTCGTT 20 



20 2) INFORMATION FOR SEQ ID NO: 803 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 803 

35 ACTTCAGTGG TAACACCAGC 20 



2) INFORMATION FOR SEQ ID NO: 804 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 804 

CCTGGGACGG CCTCTGGCAT 20 

55 

2) INFORMATION FOR SEQ ID NO: 805 

60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 805 

CTCTTGTCCA TCTTAGCAGT 



2} INFORMATION FOR SEQ ID NO: 806 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 806 

AGCATCACCA GACTTGATAA G 



2) INFORMATION FOR SEQ ID NO: 807 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 807 

AAAGTGGCTT CAAAGGTTGC 



2) INFORMATION FOR SEQ ID NO: 808 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 808 
GCITTAIWRG CATTAGAARA YCCA 24 



2) INFORMATION FOR SEQ ID NO: 809 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 809 
TCTTCCTGTW GCAACTGTTC CTCT 2 4 



2) INFORMATION FOR SEQ ID NO: 810 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 810 

AGAGMWACAG ATAARSCATT CTTA 24 



2) INFORMATION FOR SEQ ID NO: 811 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 811 
TRAARTAGAA TTGTGGTCTR TATCC 25 



2) INFORMATION FOR SEQ ID NO: 812 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 26 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 



PCT/CA00/01150 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 812 
GTIACIGGIT CYTYRARRTT ICCICC 



26 



15 



20 



2) INFORMATION FOR SEQ ID NO: 813 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 813 



25 AATCYGTYGA AATGCAYCAC GA 



22 



30 



35 



40 



2 ) INFORMATION FOR SEQ ID NO: 814 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 814 
G C I GGCACGT AC AC I GCCTG 



20 



4 5 2 ) INFORMATION FOR SEQ ID NO: 815 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 815 



TGGTGCATYT CKACRGACTT 



20 



60 
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2) INFORMATION FOR SEQ ID NO: 816 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 816 
GCTACGACGA GATCAAGGGC 



2) INFORMATION FOR SEQ ID NO: 817 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 817 
TGGAAGAAGG CCGAGGAGTT 



2) INFORMATION FOR SEQ ID NO: 818 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 818 
AGCCGGGCTG GATCTTCTTC 



2) INFORMATION FOR SEQ ID NO: 819 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 819 

460 
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2) INFORMATION FOR SEQ ID NO: 823 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 823 
GAGCGGTATG AYGAGATTGT 



2) INFORMATION FOR SEQ ID NO: 824 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 824 
GGCTTCTGCG GCACCATGCG 



2) INFORMATION FOR SEQ ID NO: 825 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 825 
ATGAGCARCG SAACCATCGT TCAGTG 



2) INFORMATION FOR SEQ ID NO: 82 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 826 
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TCGATCGTGC CGACCATGTA GAACGC 2 6 



2) INFORMATION FOR SEQ ID NO: 827 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 446 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium novyi 

(B) STRAIN: ATCC 19402 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 827 

CACCAACTTG CTAAATGGGG AGATGCCCAG ATTGTTGTAT ATATAGGCTG 50 

TGGAGAACGT GGAAATGAAA TGACAGATGT TCTTAATGAG TTTCCAGAAC 100 

TTAAAGATCC TAAGACTGGC AAATCAATAA TGGAAAGAAC AGTTTTAATA 150 

25 GCAAATACTT CTAATATGCC AGTTGCAGCC CGTGAAGCTT GTATATATAC 200 

AGGAATCACA ATAGCAGAAT ATTTTAGAGA TATGGGATAT TCAGTAGCAC 250 

TTATGGCGGA TTCCACTTCA CGTTGGGCAG AGGCATTAAG AGAAATGTCT 300 

GGAAGACTTG AAGAAATGCC TGGTGATGAA GGTTACCCAG CTTATTTAGG 350 

ATCAAGACTT GCTGATTTCT ATGAAAGAGC TGGAAAAGTT GTGTGTTTAG 400 

30 GAGACGATGA AAGAGAAGGT GCCATTACTG CAATAGGTGC TGTATC 446 



2 ) INFORMATION FOR SEQ ID NO : 828 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 445 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Clostridium difficile 

(B) STRAIN: 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 828 

50 CAGCATCAGC TTGCTAAATG GGCAGATGCA GATATAGTTG TATATATAGG 50 

CTGTGGCGAG CGTGGAAATG AAATGACAGA TGTTCTTCTT GAATTTCCTG 100 

AATTAAAAGA CCCAAGAACA GGCGAGTCAC TTATGCAAAG AACTGTGCTT 150 

ATAGCAAATA CATCAGATAT GCCGGTTGCT GCACGTGAAG CTTCTATATA 200 

CACTGGTATT ACAATAGCTG AATATTTTAG AGATATGGGA TATAGTGTTG 250 

55 CACTTATGGC AGACTCTACA TCAAGATGGG CTGAGGCTCT TAGAGAGATG 3 00 

AGTGGTCGTT TAGAGGAGAT GCCTGGTGAA GAAGGTTATC CTGCATACTT 350 

AGGTTCACGT CTTGCTCAAT TCTATGAGAG AGCAGGAAAG GTAAATTGTC 400 

TAGGTATGGA TGAAAGAGAA GGAACACTTA CAGCAATTGG TGCAG 445 



60 
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2) INFORMATION FOR SEQ ID NO: 829 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 445 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium septicum 

(B) STRAIN: ATCC 12464 

(xi) SEQUENCE DESCRIPTION: SEQ 

ATGCTATAGC TAAATGGGGA GACAGCGAAA 
GGAGAACGTG GTAACGAAAT GACAGACGTT 
TATTGACCCA AAAACTGGGG AAAGTTTAAT 
CTAATACTTC AAACATGCCA GTTGCTGCTA 
GGTATTACAA TAGCTGAATA CTTCAGAGAT 
AATGGCTGAT TCAACTTCAA GATGGGCAGA 
GTAGACTTGA AGAAATGCCA GGTGATGAAG 
TCAAGACTTG CTGATTATTA CGAAAGAGCA 
TAAAGATGGT AGAGAAGGTG CTGTAACAGC 



ID NO: 829 



TAGTTGTTTA 


CGTTGGATGT 


50 


CTTAACGAAT 


TCCCAGAACT 


100 


GAAGAGAACA 


GTAC TTATAG 


150 


GAGAAGCTTG 


CATATACACA 


200 


ATGGGATACT 


CAGTATCTAT 


250 


AGCATTAAGA 


GAAATGTCAG 


300 


GATATCCAGC 


GTACTTAGGA 


350 


GGTAAGGTTG 


TTTGTCTAGG 


400 


AATTGGAGCT 


GTATC 


445 



2) INFORMATION FOR SEQ ID NO: 830 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium botulinum 

(B) STRAIN: 20:3.1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 830 

TCAAATTGCT AAATGGGGAG ATGCAGAAAT CGTTGTTTAC . GTTGGATGC6 50 

GAGAACGTGG TAACGAAATG ACAGACGTTG TTAATGAGTT CCCAGAACTT 100 

ATTGACCCTA AGACTGGCGA AAGCTTAATG AAGAGAACAG TTCTTATAGC 150 

TAATACTTCA AACATGCCAG TTGCAGCGAG AGAAGCTTCA ATATATACAG 200 

GTATCACAAT AGCTGAATAT TTCAGAGATA TGGGATATGC AGTATCAATA 250 

ATGGCTGACT CAACTTCAAG ATGGGCTGAG GCATTAAGAG AAATGTCTGG 300 

TAGACTTGAA GAAATGCCTG GTGATGAAGG ATATCCAGCT TACCTTGGAT 350 

CAAGACTTGC TGATTACTAT GAAAGAGCTG GTAAGGTTGA ATGTTTAGGT 400 

AATGATGGAA GAATTGGTTC TATAACAGCA ATCGGTGCGG TATC 444 



2) INFORMATION FOR SEQ ID NO: 831 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 456 bases 

464 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium perfringens 

(B) STRAIN: ATCC 13124 

(xi) SEQUENCE DESCRIPTION: SEQ 

TAGTTCAGCA CCAAGTTGCT AAATGGGGAG 
GTTGGATGTG GAGAACGTGG TAACGAGATG 
CCCAGAACTT AAAGACCCTA AAACTGGGGA 
TTCTTATTGC TAATACATCT AACATGCCAG 
ATATATACTG GTATAACAAT AGCAGAGTAT 
AGTATCAATC ATGGCTGACT CAACTTCACG 
AAATGTCAGG AAGACTTGAA GAAATGCCAG 
TACTTAGGAT CAAGACTTGC TGATTACTAT 
AGCTTTAGGT AAAGATGGAA GAGAAGGAGC 
TATCCC 



2) INFORMATION FOR SEQ ID NO: 832 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium tetani 

(B) STRAIN: ATCC 19406 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 832 

CCAACTTGCA AAATGGGCTG ATGCTCAAAT AGTTGTGTAC ATAGGATGTG 50 

GAGAACGTGG AAATGAAATG ACAGACGTTT TAAATGAGTT CCCAGAATTA 100 

AAGGATCCTA AAACCGGGGA ATCTTTAATG AAAAGAACTG TGTTAATAGC 150 

AAATACATCT AATATGCCTG TTGCAGCTAG AGAAGCATCT ATATATACTG 200 

GTATAACAAT AGGGGAATAT TTTAGAGATA TGGGATATTC AATAGCACTA 250 

ATGGCAGATT CGACTTCTAG ATGGGCAGAG GCTCTAAGAG AAATGTCTGG 300 

iVAGACTAGAG GAGATGCCAG GTGAAGAAGG TTATCCAGCT TATTTAGGAT 350 

CTAGATTAGC AGAGTTCTAT GAAAGAGCAG GTAATGTTAT ATGTTTAGGT 400 

CAGGATGGAA GAGAAGGAGC ATTAACAGCT ATAGGAGCAG TTTC 444 



2) INFORMATION FOR SEQ ID NO: 833 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1786 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

465 



ID NO: 831 

ATACTGAGAT AGTTGTTTAC 50 

ACAGACGTTC TTAACGAATT 100 

AAGCTTAATG AAGAGAACAG 150 

TTGCTGCCAG AGAAGCATCA 2 00 

TTCAGAGATA TGGGATACTC 250 

TTGGGCAGAG GCTTTAAGAG 300 

GAGACGAAGG TTACCCAGCA 350 

GAAAGAGCTG GTAAAGTTGT 400 

TGTTACAGCT ATCGGAGCAG 450 
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(ii) MOLECULE TYPE: Genomic DNA * 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 
5 (C) ACCESSION NUMBER: Genome project 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 833 

TGAACCAAGG AAAAATAATA ACCGTTTCGG GACCTCTTGT TGTGGCTTCT 50 

10 GGGATGCAAG AAGCTAATAT TCAAGATATT TGTCGTGTGG GACATCTTGG 100 

CTTAGTCGGA GAAATTATTG AAATGCGTCG CGATCAAGCG TCTATTCAGG 150 

TTTATGAGGA AACATCAGGG ATCGGTCCAG GAGAACCAGT AGTGACTACT 200 

GGTTGTCCTT TGTCGGTCGA GTTAGGCCCG GGCCTGATTT CAGAAATGTT 250 

TGACGGTATT CAGCGACCGC TTGATCGTTT TCAAAAAGCA ACGGACAGCG 300 

15 ACTTTTTAAT CCGTGGTGTG GCTATCCCAA GTCTTGATCG AAAGGCTAAG 350 

TGGGCATTTA TTCCCAAGCT AAGTGTTGGT CAAGAAGTAG TTGCAGGTGA 400 

TATTTTAGGA ACTGTGCAAG AAACAGCTGT CATTGAGCAC CGTATCATGG 450 

TTCCTTATAA AGTTTCAGGG ACCTTGGTGG CTATTCATGC AGGGGACTTC 500 

ACAGTAACAG ATACAGTTTA TGAAATTAAG CAGGAAGACG GTTCCATTTA 550 

2 0 CCAAGGTAGC CTCATGCAGA CTTGGCCAGT TCGTCAAAGT CGCCCTGTTG 600 

CTCAAAAGCT TATCCCAGTC GAACCTTTGG TTACAGGTCA ACGGGTTATT 650 

GACACCTTTT TCCCTGTTAC AAAAGGTGGT GCCGCTGCCG TTCCTGGACC 700 

ATTTGGGGCA GGAAAAACAG TTGTGCAGCA TCAAATAGCT AAATTTGCCA 750 

ACGTTGATAT TGTTATTTAT GTCGGTTGTG GGGAACGCGG CAACGAGATG 800 

25 ACCGACGTTT TGAATGAGTT TCCAGAGTTA ATTGACCCAA ATACAGGCCA 850 

GTCCATTATG GAGCGCACGG TGTTAATTGC AAACACCTCT AATATGCCAG 900 

TAGCAGCGCG TGAAGCGTCG ATTTACACAG GTATTACCAT TGCCGAATAT 950 

TTCCGTGATA TGGGCTATTC TGTGGCTATC ATGGCAGACT CGACATCACG 1000 

TTGGGCAGAA GCTCTGCGCG AGATGTCAGG ACGCCTACAA GAAATGCCTG 1050 

30 GTGATGAAGG CTACCCGGCT TACTTAGGGA GTCGTATTGC CGAATATTAT 1100 

GAACGGGCTG GTCGTGTTCG GACCTTGGGA AGTCAAGAAC GTGAGGGAAC 1150 

CATTACAGCC ATCGGCGCGG TTTCTCCTCC TGGAGGGGAT ATTTCAGAGC 1200 

CTGTCACTCA AAACACCCTT: CGGATTGTCA AAGTTTTCTG GGGGCTCGAC 1250 

GCGCCTCTTG CGCAACGGCG TCACTTCCCA GCGATTAACT GGCTGACGTC 13 00 

35 TTATTCATTG TATCAAGATG ATGTAGGAAG CTATATTGAC CGTAAACAGC 1350 

AATCTAATTG GTCCAACAAG GTAACTCGTG CCATGGCTAT TTTGCAGCGT 14 00 

GAAGCCAGTC TAGAAGAAAT TGTACGCTTG GTGGGGCTTG ATTCACTGTC 1450 

TGAACAAGAT CGTTTGACCA TGGCTGTTGC CCGGCAAATT CGGGAGGATT 1500 

ATCTCCAGCA AAATGCCTTT GATTCGGTGG ATACCTTTAC TTCCTTTCCG 1550 

40 AAACAAGAGG CCATGCTAAC CAATATTTTG ACCTTTAATG AGGAAGCCAG 1600 

CAAAGCCCTT TCTTTGGGAG CTTATTTTAA TGAGATTATG GAAGGCACTG 1650 

CTCAGGTACG CGATCGCATC GCACGCAGCA AATTTATCCC AGAAGAAAAC 1700 

TTAGAGCAGA TTAAAGGGCT TACTCAGAAG GTTACCAAAG AGATTCACCA 1750 

CGTTTTAGCA AAGGGAGGAA TTTAGATGAG CGTTCT 1786 

45 



2) INFORMATION FOR SEQ ID NO: 834 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 99 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bovis 
60 (B) STRAIN: Suarez-3 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 834 

TATCTCACGT AAGTTTTTGC GCGGCGGTTA TATATCACTC CAGGCCCTGG 50 

CTAAGTACGC TAATACTGAC GTTACTGTCT ATGTGGGATG TGGAGAGCGT 100 

GGAAACGAGA TTGCGGAGGT GCTTAAGGAG TTCCCTGAGC TGAAGACCAA 150 

GGTTGATGGC AAGGAAGTGA GCATTATGAA ACGCACTTGC TTGGTGGCCA 200 

ATACTTCAAA CATGCCAGTG GCCGCCAGGG AGGCTAGTAT CTACACTGGC 250 

ATTACCCTAT GTGAATACTT CAGGGATATG GGATACAACG CCTGTGTGAT 3 00 

GGCGGATTCC ACCAGTCGTT GGGCTGAGGC TTTGCGTGAG ATATCAGGTC 350 

GTTTAGCTGA GATGCCTGCT GATTCAGGTT ATCCCGCCTA CCTTGCTTCT 4 00 

AGGCTTTCGG CGTTCTATGA GCGTGCTGGT ACAGCTGAGT GTATTGGAAC 4 50 

ACCACTTCGT GAAGGTTCAG TTACCATTGT TGGTGCTGTA TCTCCACCA 4 99 



2) INFORMATION FOR SEQ ID NO: 835 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 464 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 835 



TTTCTCAAGC TTTGAGTAAA TATAGTAACT 
GGTTGTGGAG AAAGAGGAAA TGAAATGGCA 
TGAGCTTTAT ACTATGGTTG ATGGAAAGAA 
CTTGTTTAGT AGCTAATACA TCAAATATGC 
TCCATCTACA CTGGTATTAC ACTTTCTGAA 
TAATGTTTCT ATGATGGCAG ATTCAACTTC 
GAGAAATTTC TGGTAGATTA GCTGAAATGC 
GCATATTTAG GCGCCAGACT TGCTTCATTC 
TAAATGTATG GGTTCCCCAG ATAGAGAAGG 
CAGTTTCTCC ACCT 



CTGATGTTAT TATTTACATT 50 

GAAGTTCTTA CAGAATTCCC 100 

GGAGTCAATT ATGGAAAGAA 150 

CTGTCGCTGC TAGAGAAGCT 200 

TACTTTAGAG ATATGGGATG 250 

TCGTTGGGCT GAAGCTCTTA 3 00 

CTGCAGATTC GGGTTACCCA 350 

TATGAAAGAT CAGGAAGAGT 400 

TACAGTAACA ATTGTTGGTG 450 

464 



2) INFORMATION FOR SEQ ID NO: 83 6 
3) 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 446 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania infantum 

(B) STRAIN: MOU 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 836 
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GCCCTCTCCA AGTACTCCAA 
CGAGCGCGGT AATGAGATGG 
CGACCGTGAT CGATGGCCGC 
GTGGCGAACA CCTCGAACAT 
CACCGGCATC ACCCTGGCCG 
CCATGATGGC TGACTCGACG 
TCGGGTCGTC TGGCGGAGAT 
CAGCGCTCGT CTCGCCTCCT 
TCGGCGGGCC GAAGCGCCAG 



CTCCGATTGC 
CCGAGGTGCT 
GAGGAGTCGA 
GCCAGTCGCA 
AGTACTACCG 
TCTCGCTGGG 
GCCGGCGGAT 
TCTACGAGCG 
GGCTCCGTCA 



GTCATCTATG TCiSGCTGCGG 
CATGGAGTTC CCGACCCTGA 
TCATGAAGCG CACCTGCCTC 
GCCCGTGAGG CCTCTATTTA 
TGATATGGGC AAGCATATCG 
CCGAGGCGCT TCGTGAGATT 
GGTGGCTACC CCGCCTACQT 
CGCCGGCCTC GTCACCTGCA 
CGATCGTCGG TGCCGT 



50 
100 
150 
200 
250 
300 
350 
400 
446 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 83 7 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 456 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania major 

(B) STRAIN: ATCC 50122 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 837 

TTAGTCAGGC CCTCTCCAAG TACTCCAACT CCGACTGCGT CATCTATGTC 50 

GGCTGCGGCG AGCGCGGTAA TGAGATGGCT GAGGTGCTCA TGGATTTCCC 100 

AACTCTGACG ACCGTGATCG ATGGTCGCGA GGAGTCCATC ATGAAGCGCA 150 

CCTGCCTCGT GGCAAACACT TCGAACATGC CAGTCGCAGC CCGCGAGGCC 200 

TCTATTTACA CCGGCATCAC CCTGGCCGAG TACTACCGTG ATATGGGCAA 250 

GCATATTGCC ATGATGGCCG ACTCGACATC TCGCTGGGCC GAGGCGCTTC 300 

GTGAGATTTC CGGTCGTCTG GCGGAGATGC CAGCCGATGG TGGCTACCCT 350 

GCCTACCTCA GCGCTCGTCT CGCCTCCTTC TACGAGCGCG CCGGCCTCGT 400 

CACCTGCATC GGCGGGCCGA AGCGCCAGGG CTCCGTCACG ATCGTCGGTG 450 

CTGTGT 456 



2) INFORMATION FOR SEQ ID NO: 838 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM:. Leishmania tarentolae 

(B) STRAIN: MOU-2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 838 

AGGCCCTCTC CAAGTACTCC AACTCCGACT GCGTCATCTA CGTCGGCTGC 50 
GGCGAGCGCG GTAATGAGAT GGCCGAGGTG CTCATGGAGT TCCCGACCCT 100 
GACGACTGTG ATTGATGGCC GTGAGGAGTC GATCATGAAG CGGACCTGCC 150 
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TCGTGGCCAA CACCTCCAAC ATGCCAGTCG 
TACACTGGTA TCACCCTGGC CGAATACTAC 
CGCCATGATG GCTGACTCGA CGTCTCGCTG 
TTTCGGGTCG CCTGGCGGAG ATGCCGGCTG 
CTTAGTGCTC GTCTTGCCTC CTTCTACGAG 
CATCGGTGGG CCGAAGCGCC AGGGCTCCGT 



CAGCCCGTGA 
CGTGATATGG 
GGCGGAGGCG 
ATGGTGGGTA 
CGTGCCGGCC 
CACGATCGTC 



A^CCTCTATT 
GCAAGCATAT 
CTTCGTGAGA 
CCCCGCCTAC 
TCGTCACCTG 
GGTGCCGTGT 



200 
250 
300 
350 
400 
450 



10 



15. 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 839 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 437 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma brucei 

(B) STRAIN: EATRO 795 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 839 

GCACTCTCGA AGTACTCGAA CAGTGACGCT GTTATTTACG TGGGTTGTGG ■ 50 

TGAGCGTGGC AATGAGATGG CGGAGGTGCT CATGGACTTT CCCACCCTCA 100 

CCACCATTAT TGATGGACGT GAGGAGTCTA TCATGAAGCG CACATGCCTG 150 

GTGGCAAATA CTTCCAATAT GCCTGTTGCT GCTCGTGAGG CATCTATTTA 200 

CACTGGTATC ACCTTAGCTG AGTATTATCG GGATATGGGA AAACACATCG 250 

CCATGATGGC CGATTCAACC TCCCGCTGGG CTGAGGCTTT GCGTGAGATT 3 00 

TCCGGTCGTC TTGCTGAAAT GCCTGCAGAT GGAGGTTATC CCGCGTACCT 3 50 

CAGCGCCCGT TTGGCCTCCT TCTACGAGCG- TGCTGGCCGT GTGACATGCA 400 
TCGGTGGGCC GAAGCGTGAA GGGTCTGTAA CAATTGT , 437 



2) INFORMATION FOR SEQ ID NO: 840 

* 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1052 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: MM3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 0 

TGAAGGCTGA GCGCGAGCGC GGCATCACGA TCGACATCGC GCTCTGGAAG 50 

TTCGAGTCGC CCAAGTCTGT GTTCACGATC ATCGACGCCC CCGGCCACCG 100 

CGACTTCATC AAGAACATGA TCACGGGCAC GTCTCAGGCG GACGCCGCCG 150 

TCCTTGTCAT TGCGTCATCG CAGGGTGAGT TTGAGGCGGG CATCTCGAAG 200 

GACGGCCAGA CACGCGAGCA CGCGCTGCTC GCCTTCACGC TCGGCGTGAA 250 

GCAGATGGTT GTGTGCTGCA ACAAGATGGA CGACAAGTCG GTGAACTTCG 3 00 

CCCAGGAGCG CTACGATGAG ATTGTGAAGG AGGTGTCGGC GTACCTGAAG 350 
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AAGGTTGGGT ACAACGTGGA GAAGGTGCGC TTCATCCCCA TCTCCGGCTG A00 

GCAGGGCGAC AACATGATTG ACAAGTCGGA AAATATGCCG TGGTACAAGG 450 

GCCCCACGCT GCTGGAGGCA CTCGACATGC TGGAGCCCCC GGTGCGCCCC 500 

AGCGACAAGC CGCTGCGCCT GCCGCTGCAG GACGTGTACA AGATCGGCGG 550 

5 TATCGGCACC GTGCCGGTCG GTCGCGTGGA GACGGGCACG ATGAAGCCCG 600 

GCGACGTGGT GACGTTTGCG CCCGCCAACG TGACGACGGA GGTGAAGTCG 650 

ATTGAGATGC ACCACGAGCA GCTGGCCGAG GCCACGCCCG GCGACAACGT 700 

CGGCTTCAAC GTGAAGAACG TGTCCGTGAA GGACATCCGC CGTGGCAACG 750 

TGTGCGGCAA CTCGAAGAAC GACCCCCCAA AGGAGGCGGC CGACTTCACG 800 

10 GCGCAGGTGA TCATCCTGAA CCACCCCGGC CAGATCGGCA ACGGCTATGC 850 

GCCGGTGCTC GACTGCCACA CCTGCCACAT CGCGTGCAAG TTCGCCGAGA 900 

TCGAGTCCAA GATCGACCGC CGCTCCGGCA AGGAGCTTGA GAAGAACCCC 950 

AAGTCGATCA AGTCCGGTGA CGCCGCCATG GTGCGCATGG TGCCGCAGAA 1000 

GCCCATGTGC GTGGAGGTGT TCAACGACTA CGCTCCTCTT GGCCGCTTTG 1050 

15 CC 1052 



2) INFORMATION FOR SEQ ID NO: 841 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1061 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM:. Trypanosoma cruzi 

(B) STRAIN: CGL-1 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 841 

35 TGAAGGCTGA GCGCGAGCGC GGCATCACGA TCGACATCGC GCTCTGGAAG 50 

TTCGAGTCGC CCAAGTCTGT GTTCACGATC ATCGACGCCC CCGGCCACCG 100 

CGACTTCATC AAGAACATGA TCACGGGCAC GTCTCAGGCG GATGCCGCCG 150 

TCCTTGTCAT TGCGTCATCG CAGGGTGAGT TTGAGGCGGG CATCTCGAAG 200 

GACGGCCAGA CACGCGAGCA CGCGCTGCTC GCCTTCACGC TCGGCGTGAA 250 

40 GCAGATGGTT GTGTGCTGCA ACAAGATGGA CGACAAGTCG GTGAACTTTG 300 

CCCAGGAGCG CTACGATGAG ATTGTGAAGG AGGTGTCGGC GTACCTGAAG 350 

AAGGTTGGGT ACAACGTGGA GAAGGTGCGC TTCATCCCCA TCTCCGGCTG 4 00 

GCAGGGCGAC AACATGATTG ACAAGTCGGA AAATATGCCG TGGTACAAGG 450 

GCCCCACGCT GCTGGAGGCA CTCGACATGC TGGAGCCCCC GGTGCGCCCC 500 

4 5 AGCGACAAGC CGCTGCGCCT GCCGCTGCAG GATGTGTACA AGATCGGCGG 550 

TATCGGCACC GTGCCGGTCG GTCGCGTGGA GACGGGCACG ATGAAGCCCG 600 

GCGACGTGGT GACGTTTGCG CCCGCCAACG TGACGACGGA GGTGAAGTCG 650 

ATTGAGATGC ACCACGAGCA GCTGGCCGAG GCCACGCCCG GCGACAACGT 700 

CGGCTTCAAC GTGAAGAACG TGTCCGTGAA GGACATCCGC CGTGGCAACG 750 

50 TGTGCGGCAA CTCGAAGAAC GATCCCCCAA AGGAGGCGGC CGACTTCACG 800 

GCGCAGGTGA TCATCCTGAA CCACCCCGGC CAGATCGGCA ACGGCTATGC 850 

GCCGGTGCTT GACTGCCACA CCTGCCACAT CGCGTGCAAG TTCGCCGAGA 900 

TCGAGTCCAA GATCGACCGC CGCTCCGGCA AGGAGCTTGA GAAGAACCCC 950 

AAGTCGATCA AGTCCGGTGA CGCCGCCATG GTGCGCATGG TGCCGCAGAA 1000 

55 GCCCATGTGC GTGGAGGTGT TCAACGACTA CGCTCCTCTT GGCCGCTTTG 1050 

CCGTGCGTGA C 1061 



60 2) INFORMATION FOR SEQ ID NO: 842 
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*(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma 

(B) STRAIN: PCU-1 

(xi) SEQUENCE DESCRIPTION: SEQ 

GCTGAAGGCT GAGCGCGAGC GCGGCATCAC 
AGTTCGAGTC GCCCAAGTCT GTGTTCACGA 
CGCGACTTCA TCAAGAACAT GATCACGGGC 
CGTCCTTGTC ATTGCGTCAT CGCAGGGTGA 
AGGACGGCCA GACACGCGAG CACGCGCTGC 
AAGCAGATGG TTGTGTGCTG CAACAAGATG 
CGCCCAGGAG CGCTACGATG AGATTGTGAA 
AGAAGGTTGG GTACAACGTG GAGAAGGTGC 
TGGCAGGGCG ACAACATGAT TGACAAGTCG 
GGGCCCCACG CTGCTGGAGG CACTCGACAT 
CCAGCGACAA GCCGCTGCGC CTGCCGCTGC 
GGTATCGGCA CCGTGCCGGT CGGTCGCGTG 
CGGCGACGTG GTGACGTTTG CGCCCGCCAA 
CGATTGAGAT GCACCACGAG CAGCTGGCCG 
GTCGGCTTCA ACGTGAAGAA CGTGTCCGTG 
CGTGTGCGGC AACTCGAAGA. ACGACCCCCC 
CGGCGCAGGT GATCATCCTG AACCACCCCG 
GCGCCGGTGC TCGACTGCCA CACCTGCCAG 
GATCGAGTCC AAGATCGACC GCCGCTCCGG 
CCAAGTCGAT CAAGTCCGGT GACGCCGCCA 
AAGCCCATGT GCGTGGAGGT GTTCAACGAC 
TGCCGTGCGT GA 



cruzi 



ID NO: 842 

GATCGACATC GCGCTCTGGA 50 

TCATCGACGC CCCCGGCCAC 100 

ACGTCTCAGG CGGACGCCGC 150 

GTTTGAGGCG GGCATCTCGA 200 

TCGCCTTCAC GCTCGGCGTG 250 

GACGACAAGT CGGTGAACTT 300 

GGAGGTGTCG GCGTACCTGA 350 

GCTTCATCCC CATCTCCGGC 400 

GAAAATATGC CGTGGTACAA 450 

GCTGGAGCCC CCGGTGCGCC 500 

AGGACGTGTA CAAGATCGGC 550 

GAGACGGGCA CGATGAAGCC 600 

CGTGACGACG GAGGTGAAGT 650 

AGGCCACGCC CGGCGACAAC 700 

AAGGACATCC GCCGTGGCAA 750 

AAAGGAGGCG GCCGACTTCA 800 

GCCAGATCGG CAACGGCTAT 850 

ATCGCGTGCA AGTTCGCCGA 900 

CAAGGAGCTT GAGAAGAACC 950 

TGGTGCGCAT GGTGCCGCAG 1000 

TACGCTCCTC TTGGCCGCTT 1050 

1062 



2) INFORMATION FOR SEQ ID NO: 843 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1057 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bovis 

(B) STRAIN: Suarez-3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 843 

TACATTGACA GCTGCATTGA CCAAGGTCTG TTCGATGGGT GGTCATGGCG 50 

AGTACACTCC TTATGAAGCA ATTGACCGTG CTCCTGAGGA GCGTAAACGT 100 

GGTATTACTA TAAATTCGAC ACATGTTGAA TATGAGACTA AGAACCGTCA 150 

TTACGGTCAC GTGGACTGTC CAGGTCACTC TGATTATGTG AAGAACATGA 200 
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TATCTGGCGC TGCTCAGATG GATGGTGCCA TATTGGTTGT TTCT^GTGTT 2^0 

GACGGTCCCA TGCCTCAGAC TAAGGAGCAC GTGTTGCTTG CTAAGCAGAT 300 

TGGTGTACCT CGTTTAGTTG TGTTTTTGAA CAAGCTTGAC ATGTTAGAGG 350 

ACTCTGAGCT ATTGGAGTTG GTGGAGTTAG AGGTTCGTGA GTTATTGAGT 400 

5 GAGTTTGGTT ACGACGGTGA CAACACGCCT ATCGTTCGTG GCAGTGCTAT 450 

AAAGGCATTG AACAGTAGTT CCGAGGCTGA CATTAAGCCA ATTCAGGATT 500 

TATTGGATGC GTGTGATGCC TTTTTACTGA CTCCAGAACG TAAGGATGAC 550 

ATGCCGCTCT TGGTTGCTAT TGACGATGTT CTTGCCATTC CTGGCAAGGG 600 

TACTGTTGTA ACCGGTAGGA TAGAGCAGGG CAAGATTCGT TGTGGTGACC 650 

10 CTATTGAGGT TTGCGCCGGT CCGAAGTCCG GCAAGAAGAC TGTGTGTGTT 700 

GGTCTTRARA TGTTCCGCAA GAGTCTCAGT RAGGGTATTG CTGGTGACCA 750 

GATTGGTGTT TTGCTCAAGG GTGTGAAGCG CGACRAGGTA GAGCGCGGGT 800 

TTGTATTGAT TCAACCCGGA AGTTACAAAT GTCACGGTGA ATTTGATGCT 850 

GACTTGTACG TGTTGACTAC GRAGGAAGGT GGGCGCAAGC ATCCGTTTGT 900 

15 GTCTAACTAC CGTCCTCAGG CGTTTATACG TACTGGAGAC GTTTGCTGCT 950 

CAGTTCATTT GGATRAGGGT GTTGAGATGG CAGCTCCTGG TGACAACGTG 100 0 

CGTTGCAAGA TCAAGTTACT TTATCCCATG CCTGTCCATG AAGGTTTACG 1050 

ATTTGCG 1057 

20 

2) INFORMATION FOR SEQ ID NO: 844 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 943 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania aethiopica 

(B) STRAIN: ATCC 50119 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 844 



CGGTGGCATC ATTGTGGTGG CGGCCACCGA CGGCGTCATG CCGCAGACAC 50 

GCGAGCACCT CCTGATCTGC TCGCAGATTG GGCTTCCGGC GCTCGTAGGG 100 

40 TTCATCAACA AAGTGGATAT GACGGACGAG GACACGTGCG ACCTGGTGGA 150 

CATGGAGGTG CGCGAGCAGC TGGAGAAATA CAAGTTTCCG GCGGAGGAGA 2 00 

CACCAATCGT GCGCGGCTCG GCCCTCAAGG CCGTCGAGGG CGACGCGAAG 250 

TACGAGGAGA ACATCCTCGA ACTGGTGCGG AAGTGCGACG AGTGGATCCC 3 00 

CGACCCGCCG CGCAACACAG ACAAGCCTTT CCTTATGGCC ATCGAGCACG 350 

45 TTTACGAGAT CGGCAAGGAC AAGAAGAGCG TCATCGTGAC CGGCCGCGTC 400 

GATCAGGGCG TGCTGAAGCT CAACACAGAC GCCGAGCTGG CCGGCTTCAG 450 

CGCCAAGAAG TCGACGGTGA GGGTGACGGG CATCGAGATG TACCACAAGA 500 

CGCTGAGCGA GTGCATGCCC GGTGACTCCG TCGGCGTCAG CATTGTCGGC 550 

ACCGGCGACA CAACCAGTCT GTCCAAGGAC AACGTGGAAC GCGGCATGGT 600 

50 AATGGCGGCG ACGGGTAGCA CGAACCTGTA CAACAAGGTG AAGGCGCAGG 650 

TGTACGTGCT GACGAAGGAT GAGGGCGGCC GCCACACCGG CTTCAGCCCC 700 

CACTACCGCC CGCAGCTCTT CTTCCATTGC GCTGACGTGA CAGCGGACAT 750 

GAGCTTCCCG GAGGCGGAGA AGCACCGGGA GGAGCTGAAC AAGAAATTCG 800 

GCCGCGGCCC CGAGGAGGAC AAGAAGAAAG AGGCGGAGAT GAAGGAGTTC 850 

55 GAGAGCAAGC TCGTCTGCAT GCCGGGCGAT AACCGCGAGC TGATCCTGAC 900 

GCTCGCGTAC CCGATGCCCA TTGAAAAGGG CCTGAAGTTC ACC 943 



60 2) INFORMATION FOR SEQ ID NO: 845 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 939 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania amazonensis 

(B) STRAIN: ATCC 50131 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 845 



15 



CGGTGGCATC ATTGTGGTGG CGGCCACCGA CGGCGTCATG CCGCAGACAC 50 

GCGAGCATCT CTTGATCTGC TCGCAGATTG GGCTTCCGGC GCTCGTAGGG 100 

TTCATCAACA AAGTGGACAT GACGGACGAG GACACGTGCG ACCTGGTGGA 150 

CATGGAGGTG CGTGAGCAGC TGGAGAAATA CAAGTTTCCG GCGGAAGAGA 200 

20 CGCCCATCGT GCGCGGCTCG GCCCTCAAAG CCGTCGAGGG CGACGCGAAG 250 

TACGAGGAGA ACATCCTCGA ACTGGTGCGG AAGTGCGACG AATGGATCCC 300 

CGACCCGCCG CGCAACACAG ACAAGCCTTT CCTTATGGCC ATTGAGCACG 350 

TTTACGAGAT CGGCAAGGAC AAGAAGAGCG TCATCGTGAC CGGCCGCGTC 400 

GATCAGGGCG TGCTGAAGCT CAACACAGAC GCCGAGCTGG CCGGCTTCAG 450 

25 CGC CAAGAAG TCGACGGTGA GGGTGACGGG CATCGAGATG TACCACAAGA 500 

CGCTGAGTGA GTGCATGCCC GGTGACTCCG TCGGCGTCAG CATTGTCGGC 550 

ACCGGCGACA CGATCAGTCT CTCCAAGGAC AACGTTGAAC GCGGCATGGT 600 

AATGGCGGCA ACGGGTAGCA CGAACCTGTA CAACAAGGTG AAGGCGCAGG 650 

TGTACGTGCT GACGAAGGAT GAGGGCGGCC GCCACACTGG CTTCAGCCCC 700 

30 CACTACCGCC CGCAGCTCTT CTTCCATTGT GCTGACGTGA CGGCGGACAT 750 

GAGCTTCCCG 'GAGGCGGAGA AGCACCGCGA GGAGCTCAAC AAGAAATTCG 80.0 

GCCGCGGCCC CGAGGAGGAC AAGCAGAAGG AGGCGGAGAT GAAAGAGTTC 850 

GAGAGCAAGC : TCGTCTGCAT GCCGGGCGAC AACCGCGAGC TGATCCTGAC 900 

GCTGGCGTAC CCGATGCCCA TTGACAAGGG TCTGAAGTT 939 

35 



2) INFORMATION FOR SEQ ID NO: 846 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 945 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania donovani 
50 (B) STRAIN: ATCC 50212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 846 

CATTGTGGTG GCGGCCACCG ACGGCGTCAT GCCGCAGACA CGCGAGCACC 50 

55 TCCTGATCTG CTCGCAGATC GGGCTTCCGG CGCTCGTAGG GTTCATCAAC 100 

AAGGTGGACA TGACGGACGA GGACACGTGC GACCTGGTGG ACATGGAGCT 150 

GCGCGAGCAG CTGGAGAAAT ACAAGTTTCC GGCGGAGGAG ACGCCAATCG 2 00 

TGCGCGGCTC AGCCCTCAAA GCCGTCGAGG GCGATGCGAA GTACGAGGAG 250 

AACATTCTCG AACTGGTGCG GAAGTGTGAC GAGTGGATCC CTGACCCGCC 300 

60 GCGCAACACA GACAAGCCTT TCCTTATGGC CATCGAGCAC GTTTACGAGA 350 

473 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TCGGCAAGGA CAAGAAGAGC 
ATTCTGAAGC TCAACACAGA 
GTCGACGGTG AGGGTGACGG 
AGTGCATGCC TGGTGACTCC 
5 ACGACCAGTC TATCCAAGGG 
GACGGGTAGC ACGAACCTGT 
TGACGAAGGA TGAGGGCGGC 
CCGCAGCTCT TCTTCCATTG 
GGAGGCGGAG AAGCACCGCG 
10 CCGAGGAGGA CAAGAAGAAA 
CTCGTCTGCA TGCCGGGCGA 
CCCGATGCCC ATTGAAAAGG 



GTTGTCGTGA CCGGCCGCGT 
CGCCGAGCTG GCCGGCTTCA 
GCATCGAGAT GTACCACAAG 
GTCGGCGTCA GCATTGTCGG 
CAACGTGGAA CGCGGCATGG 
ACAACAAGGT GAAGGCGCAG 
CGCCACACTG GCTTTAGTCC 
TGCTGACGTG ACGGCGGACA 
AAGAGCTCAA CAAGAAATTC 
GAGGCAGCGA TGAAGGAGTT 
TAACCGCGAG CTGATCCTGA 
GTCTGAAGTT CACCATCCGT 



PCT/CA00/01150 

CGATCKGGGC 40j^ 

GCTCCAAGAA 450 

ACGCTGAGCG 500 

GACCGGCGAC 550 

TGATGGCGGC 600 

GTGTACGTGC 650 

TCACTACCGC 700 

TGAGCTTCCC 750 

GGCCGCGGCC 800 

CGAGAGCAAG 850 

CGCTGGCGTA 900 

GAGGG 945 



15 



2) INFORMATION FOR SEQ ID NO: 847 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 939 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania infantum 

(B) STRAIN: MOU 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 847 

GGCATCATTG TGGTGGCGGC CACCGACGGC GTCATGCCGC AGACACGCGA 50 

GCACCTCCTG ATCTGCTCGC AGATCGGGCT TCCGGCGCTC GTAGGGTTCA 100 

TCAACAAGGT GGACATGACG GACGAGGACA CGTGCGACCT GGTGGACATG 150 

35 GAGCTGCGCG AGCAGCTGGA GAAATACAAG TTTCCGGCGG AGGAGACGCC 200 

AATCGTGCGC GGCTCAGCCC TCAAAGCCGT CGAGGGCGAT GCGAAGTACG 250 

AGGAGAACAT TCTCGAACTG GTGCGGAAGT GTGACGAGTG GATCCCTGAC 300 

CCGCCGCGCA ACACAGACAA GCCTTTCCTT ATGGCCATCG AGCACGTTTA 350 

CGAGATCGGC AAGGACAAGA AGAGCGTTGT CGTGACCGGC CGCGTCGATC 400 

40 AGGGCGTTCT GAAGCTCAAC ACAGACGCCG AGCTGGCCGG CTTCAGCTCC 450 

AAGAAGTCGA CGGTGAGGGT GACGGGCATC GAGATGTACC ACAAGACGCT 500 

GAGCGAGTGC ATGCCTGGTG ACTCCGTCGG CGTCAGCATT GTCGGCACCG 550 

GCGACACGAC CAGTCTATCC AAGGGCAACG TGGAACGCGG CATGGTGATG 600 

GCGGCGACGG GTAGCACGAA CCTGTACAAC AAGGTGAAGG CGCAGGTGTA 650 

45 CGTGCTGACG AAGGATGAGG GCGGCCGCCA CACTGGCTTT AGTCCTCACT 700 

ACCGCCCGCA GCTCTTCTTC CATTGTGCTG ACGTGACGGC GGACATGAGC 750 

TTCCCGGAGG CGGAGAAGCA CCGCGAAGAG CTCAACAAGA AATTCGGCCG 800 

CGGCCCCGAG GAGGACAAGA AGAAAGAGGC AGCGATGAAG GAGTTCGAGA 850 

GCAAGCTCGT CTGCATGCCG GGCGATAACC GCGAGCTGAT CCTGACGCTG 900 

50 GCGTACCCGA TGCCCATTGA AAAGGGTCTG AAGTTCACC 93 9 



2) INFORMATION FOR SEQ ID NO: 84 8 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 933 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 

474 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Leishmania enriettii 

(B) STRAIN: ATCC 50120 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 848 

10 CACCGACGGC GTCATGCCGC AGACACGGGA GCACCTGCTC ATCTGCTCGC 50 

AGATCGGGCT GCCGGCGCTT GTAGGGTTCA TCAATAAAGT TGACATGACG 100 

GACGAGGATA CGTGCGACCT CGTGGACATG GAGGTGCGGG AACAGCTGGA 150 

GAAGTACAAG TTTCCGGCCG AGGAGACGCC CATCGTACGT GGCTCGGCCC 2 00 

TCAAGGCCCT CGAGGGGGAT GCGCAATACG AGGGGAGTAT TCTCGAGCTG 250 

15 GTGCGAAAGT GCGACGAGTG GATCCCCGAC CCGCCGCGCA ACACCGAAAA 300 

GCCTTTCCTC ATGGCTATCG AGCACGTTTA CGAGCTCGGC AAAGACAAGA 350 

AGAGCGTCAT CGTTACCGGC CGCGTCGATC AAGGTGTGCT GAAGCTCAAC 400 

ACAGACGCCG AGCTGGCCGG CTTCAGCGCC AAGAAGGCGA CAGTCAAAGT 450 

GACGGGCATC GAGATGTATC ACAAGACACT CAATGAGTGC ATGCCCGGCG 500 

20 ACTCTGTCGG TGTCAGCATC GTCGGTACCG GTGACACGAC CAGCTTATCC 550 

AAGGATAATG TTGAGCGCGG TATGGTAATG GCGGCAACGG GTAGCACGAA 600 

CCTGTACAAC AAGCTGAAGG CGCAGGTTTA CGTGCTGACA AAGGAGGAGG 650 

GTGGCCGCCA CACCGGGTTC AGCCCCCACT ACCGCCCGCA GCTCTTCTTC 700 

CACTGCGCTG ACGTGACCGC AGACATGAGC TTCCCGGAGG CGGAGAAGTA 750 

25 CCGCGAGGAG CTCAACAAGA AGTTCGGCCG TGGCCCTGAG GAGGACAAGA 800 

AGAAAGAGGC GGAGATGAAG GAGTTCGAAA GCAAACTTGT CTGCATGCCA 850 

GGCGATAACC GCGAGCTGAT CCTAACTCTG GCGTACCCGA TGCCCATCGA 900 

CAAGGGCCTG AAGTTCACCA TCCGTGAGGG CGG 933 

30 

2) INFORMATION FOR SEQ ID NO: 849 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 943 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania gerbilli 

(B) STRAIN: ATCC 50121 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 849 



CGGTGGCATC ATTGTGGTGG CGGCCACCGA CGGCGTCATG CCGCAGACAC 50 

GCGAGCACCT CCTGATCTGC TCGCAGATTG GGCTTCCGGC GCTCGTAGGG 100 

50 TTCATCAACA AAGTGGACAT GACGGACGAG GACACGTGCG ACCTGGTGGA 150 

CATGGAGGTG CGCGAGCAGC TGGAGAAATA CAAGTTTCCG GCGGAGGAGA 200 

CACCAATCGT GCGCGGCTCG GCCCTCAAGG CCGTCGAGGG CGACGCGAAG 250 

TACGAGGAGA ACATCCTCGA ACTGGTGCGG AAGTGCGACG AGTGGATCCC 300 

CGACCCGCCG CGCAACACAG ACAAGCCTTT CCTTATGGCC ATCGAGCACG 350 

55 TTTACGAGAT CGGCAAGGAC AAGAAGAGCG TCATCGTGAC CGGCCGCGTC 400 

GATCAGGGCG TGCTGAAGCT CAACACGGAC GCCGAGCTGG CCGGCTTCAG 450 

CGCCAAGAAG TCGACGGTGA GGGTGACGGG CATTGAGATG TACCACAAGA 500 

CGCTGAGCGA GTGCATGCCC GGTGACTCCG TCGGCGTCAG CATTGTCGGC 550 

ACCGGCGACA CGACCAGTCT GTCCAAGGAC AACGTGGAAC GCGGCATGGT 600 

60 AATGGCGGCG ACGGGTAGCA CGAACCTGTA CAACAAGGTG AAGGCGCAGG 650 

475 
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TGTACGTGCT GACGAAGGAT GAGGGCGGCC 
CACTACCGCC CGCAGCTCTT CTTCCATTGC 
GAGCTTCCCG GAGGCGGAGA AGCACCGCGA 
GCCGCGGCCC CGAGGAGGAC AAGAAGAAAG 
5 GAGAGCAAGC TCGTCTGCAT GCCGGGCGAT 
GCTGGCGTAC CCGATGCCCA TTGAAAAGGG 



PCT/CA00/01150 

GCCACACTGG CTTCA&CCCC 1<S(/ 

GCTGACGTGA CAGCGGACAT 750 

GGAGCTCAAC AAGAAATTCG 800 

AGGCGGAGAT GAAGGAGTTC 850 

AACCGCGAGC TGATCCTGAC 900 

TCTGAAGTTC ACC 943 



10 2) INFORMATION FOR SEQ ID NO: 850 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania major 

(B) STRAIN: ATCC 50122 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 850 



25 



GGCATCATTG TGGTGGCGGC CACCGACGGC GTCATGCCGC AGACACGCGA 50 

ACACCTCCTG ATCTGCTCGC AAATTGGCCT TCCGGCGCTC GTAGGGTTCA 100 

TCAACAAAGT GGACATGACG GACGAGGACA CGTGTGACCT GGTGGACATG 150 

GAGGTGCGCG AGCAGCTGGA GAAATACAAG TTTCCGGCGG AGGAGACACC 2 00 

30 AATCGTGCGC GGCTCGGCCC TCAAGGCCGT CGAGGGCGAC GCGAAGTACG 250 

AGGAGAACAT CCTCGAACTG GTGCGGAAGT GCGACGAGTG GATCCCCGAC 3 00 

CCGCCGCGCA ACACAGACAA GCCTTTCCTT ATGGCCATCG AGCACGTTTA 350 

CGAGATCGGC AAGGACAAGA AGAGCGTCAT CGTGACCGGC CGCGTCGATC 400 

AGGGCGTGCT GAAGCTCAAC ACAGACGCCG AGCTGGCCGG CTTCAGCGCC 450 

35 AAGAAGTCGA CGGTGAGGGT GACGGGCATT GAAATGTACC ACAAGACGCT 500 

GAGCGAGTGC ATGCCCGGTG ACTCCGTCGG CGTCAGCATT GTCGGCACCG 550 

GCGACACGAC CAGTCTGTCC AAGGACAACG TGGAGCGCGG CATGGTAATG 600 

GCGGCGACGG GTAGCACGAA CCTGTACAAC AAGGTGAAGG CGCAGGTGTA 650 

CGTGCTGACG AAGGATGAGG GCGGCCGCCA CACTGGCTTC AGCCCCCACT 700 

40 ACCGCCCGCA GCTCTTCTTC CATTGCGCTG ACGTGACAGC GGACATGAGC 750 

TTCCCGGAGG CGGAGAAGCA CCGCGAGGAG CTCAACAAGA AATTCGGCCG 800 

CGGCCCCGAG GAGGACAAGA AGAAAGAGGC GGAGATGAAG GAGTTCGAGA 850 

GCAAGCTCGT CTGCATGCCG GGCGATAACC GCGAGCTGAT CCTGACGCTG 90 0 

GCGTACCCGA TGCCCATT 918 

45 



2) INFORMATION FOR SEQ ID NO: 851 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania mexicana 
60 (B) STRAIN: ATCC 50156 

476 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 851 

CGGTGGCATC ATTGTGGTGG CGGCCACCGA CGGCGTCATG CCGCAGACAC 50 

5 GCGAGCATCT CCTGATCTGC TCGCAGATTG GGCTTCCGGC GCTCGTAGGG 100 

TTCATCAACA AAGTGGACAT GACGGACGAG GACACGTGCG ACCTGGTGGA 150 

CATGGAGGTG CGTGAGCAGC TGGAGAAATA CAAGTTTCCG GCGGAAGAGA 200 

CGCCCATCGT GCGCGGCTCG GCCCTCAAGG CCGTCGAGGG CGACGCGAAG 250 

TACGAGGAGA ACATCCTCGA ACTGGTGCGG AAGTGCGACG AATGGATCCC 300 

10 CGACCCGCCG CGCAACACAG ACAAGCCTTT CCTTATGGCC ATTGAGCACG 350 

TTTACGAGAT CGGCAAGGAC AAGAAGAGCG TCATCGTGAC CGGCCGCGTC 400 

GATCAGGGCG TGCTGAAGCT CAACACAGAC GCCGAGCTGG CCGGCTTCAG 450 

CGTCAAGAAG TCGACGGTGA GGGTGACGGG CATCGAGATG TACCACAAGA 500 

CGCTGAGTGA GTGCATGCCC GGTGACTCCG TCGGCGTCAG CATTGTCGGC 550 

15 ACCGGCGACA CGATCAGTCT CTCCAAGGAC AACGTTGAAC GCGGCATGGT 600 

AATGGCGGCA ACGGGTAGCA CGAACCTGTA CAACAAGGTG AAGGCGCAGG 650 

TGTACGTGCT GACGAAGGAT GAGGGCGGCC GCCACACTGG CTTCAGCCCC 700 

CACTACCGCC CGCAGCTCTT CTTCCATTGT GCTGACGTGA CGGCGGACAT 750 

GAGCTTCCCG GAGGCGGAGA AGCACCGCGA GGAGCTCAAC AAGAAATTCG 800 

20 GCCGCGGCCC CGAGGAGGAC AAGCAGAAGG AGGCGGAGAT GAAAGAGTTC 850 

GAGAGCAAGC TCGTCTGCAT GCCGGGCGAC AACCGCGAGC TGATCCTGAC 900 

GCTGGCGTAC CCGATGCCCA TTGAGAAGGG TCTGAAGTT 93 9 



25 



2) INFORMATION FOR SEQ ID NO: 852 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 912 bases 
30 * (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tarentolae 

(B) STRAIN: MOU-2 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 852 

TCATTGTGGT GGCCGCCACC GACGGCGTCA TGCCGCAAAC ACGGGAGCAC 50 

CTTTTGATCT GCTCGCAGAT CGGGCTGCCG GCGCTCGTAG GGTTCATCAA 100 

CAAAGTGGAC ATGACAGACG AAGACACGTG CGACCTGGTA GACCTGGAGG 150 

45 TGCGTGAGCA GCTGGAGAAG TACAAGTTTC CGGCAGAGGA AACACCAATC 200 

GTGCGTGGCT CGGCCCTCAA GGCCGTTGAG GGCGATGCAA AGTACGAGGA 250 

GAACATCCTC GAACTGGTGC GGAAGTGCGA CGAGTGGATC CCAGACCCGC 300 

CACGCAATAC GGACAAGCCT TTCCTTATGG CCATTGAACA CGTGTACGAG 350 

ATCGGCAAGG ATAGGAAAAG CGTCATCGTA ACCGGCCGCG TCGATCAAGG 400 

50 TGTGCTGAAG CTGAACACAG ACGCCGAGCT GGCCGGCTTC AGCGCCAAGA 450 

AGTCGACGGT GAAAGTGACG GGCATTGAGA TGTACCACAA GACACTGACA 500 

GAGTGCATGC CCGGCGACTC TGTCGGCGTC AGCATTGTGG GCACTGGYGA 550 

CACGACCAGC CTCTCTAAGG ACAATGTTGA GCGTGGCATG GTACTGGCCG 600 

CTACGGGTAG CACGAACCTG TACAACAAAG TAAAGGCGCA GGTGTATGTA 650 

55 CTCACGAAGG ATGAGGGCGG CCGCCACACC GGCTTCAGCC CCCACTACCG 700 

TCCGCAGCTC TTCTTCCACT GCGCTGACGT AACGGCGGAC ATGAGCTTCC 750 

CGGAGGCGGA GAAGCACCGC GAGGAACTCA ATAAGAAATT CGGCCGCGGC 800 

CCCGAGGAGG ACAAGAAAAA GGAGGCGGAG ATGAAGGAGT TCGAGAGCAA 850 

GCTGGTCTGC ATGCCAGGCG ATAACCGCGA GCTGATCCTG ACATTGGCGT 900 

60 ACCCGATGCC TA 912 

477 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
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2 ) INFORMATION FOR SEQ ID NO: 853 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 936 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Txypanosoma cruzi 

(B) STRAIN: MM3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 853 

ATTCTTGTGG TGGCAGCTAA CGACGGATGC ATGCCGCAGA CGCGTGAGCA 50 

CCTGCTTATT TGTTCGCAGA TTGGCCTTCC TGCTCTTGTA TGCTTTATCA 100 

ATAAGTGTGA CATGATGCAA GGGCAGGAGG AAATGATTGA ACTTGTTGAA 150 

ATGGAGGTAC GTGAACTTTT GGAGAAGTAC AAGTTCCCTG CGGAGGAGAC 200 

GCCATTTGTG CGGGGGTCTG CGGTGAAGGC ATTGGAGGGT GATGCTGAAA 250 

ATGAAGGAAA GATTTTGGAG CTTGTAAAAA AATGTGATGA ATGGATTCCC 300 

GACCCACCGC GTGCCATTGA AAAACCGTTC CTTATGGCCA TTGAGCACGT 350 

TTTTGAGGTT GGAAAGGATA AGAAGGCCGT TGTTGTGAGC GGGCGTGTGG 400 

ACCAGGGGCA GTTGAAGGTC GGCGCAGATG CAGAACTTTC CGGGTTTAGC 450 

GCAAAGAAGC TGACGGTGAA GGTTGCTAGC ATCGAAATGT AC C ATAAAAT 500 

TCTGGAGGAT TGCATGCCTG GTGACTCTGT TGGCGCGAAG ATCGTTGGCA 550 

GCGGTGAAAC AGTGAACCTG TCGAAGGAAA ATGTGGAACG CGGCATGGTA 600 

CTCTCCGCAC CAGGTGCAAC GACACTGTTC AACAAGGTCC GCGCGCAGGT 650 

GTACGTGTTG ACAAAGGAAG AAGGCGGTCG TCACACAGCC TTTAGTCCTC 700 

ACTATCGTCC GCAGCTTTTC TTCCACTGTG CTGATGTCAC GGCAGATATT 750 

AACTTCCCGG AAAGCGAGAA GCTTGCAGGG GAGCTGAACA AAAAGTATGG 800 

CCGTGATGCG GCGGAACAGA AGAAGAAGGA GGCAGAACTG AAAGAGTTTG 850 

AAAAGACGCT TGTCTGCATG CCTGGTGATA ACCGCGAACT CCTGCTCACC 900 

CTTGCCTATC CAATGCCAAT GGAAAAGGGA CTCAAG 936 



2) INFORMATION FOR SEQ ID NO: 854 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 934 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: PCU-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 854 

CGGCATTCTT GTGGTGGCAG CTAACGACGG ATGCATGCCG CAGACGCGTG 50 
AGCACCTGCT TATTTGTTCG CAGATTGGCC TTCCTGCTCT TGTATGCTTT 100 
ATCAATAAGT GTGACATGAT GCAAGGGCAG GAGGAAATGA TTGAGCTTGT 150 

478 
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TGAAATGGAG GTACGTGAAC TTTTGGAGAA GTACAAGTTC CCTGCGG^GG 200 

AGACGCCATT TGTGCGGGGG TCTGCGGTGA AGGCATTGGA GGGTGATGCT 250 

GAAAATGAAG GAAAGATTTT GGAGCTTGTA AAAAAATGTG ATGAATGGAT 300 

TCCCGACCCA CCGCGTGCCA TTGAAAAACC GTTCCTTATG GCCATTGAGC 350 

5 ACGTTTTTGA GGTTGGAAAG GATAAGAAGG CCGTTGTTGT GAGCGGGCGT 400 

GTGGACCAGG GGCAGTTGAA GGTCGGCGCA GATGCAGAAC TTTCCGGGTT 450 

TAGTGCAAAG AAGCTGACGG TGAAGGTTGC TAGCATCGAA ATGTACCATA 500 

AAATTCTGGA GGATTGCATG CCTGGTGACT CTATTGGCGC GAAGATCGTT 550 

GGCAGCGGTG AAACAGTGAA CCTGTCGAAG GAAAATGTGG AACGCGGCAT 600 

10 GGTACTCTCC GCACCAGGTG CAACGACACT GTTCAACAGG GTCCGCGCGC 650 

AGGTGTACGT GTTGACAAAG GAAGAAGGCG GTCGTCACAC AGCCTTTAGT 700 

CCTCACTATC GTCCGCAGCT TTTCTTCCAC TGTGCTGATG TCACGGCGGA 750 

TATTAACTTC CCGGAAAGCG AGAAGCTTGC AGGGGAGCTG AACAAAAAGT 800 

ATGGCCGTGA CGCGGCGGAA CAGAAGAAGA AGGAGGCAGA ACTGAAAGAG 850 

15 TTTGAAAAGA CGCTTGTCTG CATGCCTGGT GATAACCGCG AACTCCTGCT 900 

CACCCTTGCC TATCCAATGC CAATGGAAAA GGGA 934 



20 2) INFORMATION FOR SEQ ID NO: 855 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 937 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: CGL-1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 855 



35 



TGGCGGCATT CTTGTGGTGG CAGCTAACGA CGGATGCATG CCGCAGACGC 50 

GTGAGCACCT GCTTATTTGT TCGCAGATTG GCCTTCCTGC TCTTGTATGC 100 

TTTATCAATA AGTGTGACAT GATGCAAGGG CAGGAGGAAA TGATTGAACT 150 

TGTTGAAATG GAGGTACGTG AACTTTTGGA GAAGTACAAG TTCCCTGCGG 2 00 

40 AGGAGACGCC ATTTGTGCGG GGGTCTGCGG TGAAGGCATT GGAGGGTGAT 250 

GCTGAAAATG AAGGAAAGAT TTTGGAGCTT GTAAAAAAAT GTGATGAATG 3 00 

GATTCCCGAC CCACCGCGTG CCATTGAAAA ACCGTTCCTT ATGGCCATTG 350 

AGCACGTTTT TGAGGTTGGA AAGGATAAGA AGGCCGTTGT TGTGAGCGGG 40 0 

CGTGTGGACC AGGGGCAGTT GAAGGTCGGC GCAGATGCAG AACTTTCCGG 450 

45 GTTTAGCGCA AAGAAGCTGA CGGTGAAGGT TGCTAGCATC GAAATGTACC 500 

ATAAAATTCT GGAGGATTGC ATGCCTGGTG ACTCTGTTGG CGCGAAGATC 550 

GTTGGCAGCG GTGAAACAGT GAACCTGTCG AAGGAAAATG TGGAACGCGG 600 

CATGGTACTC TCCGCACCAG GTGCAACGAC ACTGTTCAAC AAGGTCCGCG 650 

CGCAGGTGTA CGTGTTGACA AAGGAAGAAG GCGGTCGTCA CACAGCCTTT 700 

50 AGTCCTCACT ATCGTCCGCA GCTTTTCTTC CACTGTGCTG ATGTCACGGC 750 

AGATATTAAC TTCCCGGAAA GCGAGAAGCT TGCAGGGGAG CTGAACAAAA 800 

AGTATGGCCG TGATGCGGCG GAACAGAAGA AGAAGGAGGC AGAACTGAAA 850 

GAGTTTGAAA AGACGCTTGT CTGCATGCCT GGTGATAACC GCGAACTCCT 900 

GCTCACCCTT GCCTATCCAA TGCCAATGGA AAAGGGA 937 

55 

2) INFORMATION FOR SEQ ID NO: 856 

60 (i) SEQUENCE CHARACTERISTICS: 

479 
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(A) LENGTH: 888 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bigemina 
10 (B) STRAIN: Suarez-2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 856 

CTTATTATGG AGCTGATCAA CAACGTCGCG AAGAAGCACG GTGGTTTCTC 50 

15 CGTGTTCGCC GGCGTCGGCG AGCGCACCAG GGAGGGCAAC GAGCTGTACC 100 

ACGAGATGAT GGAGACCGGC GTCATCAAGC GCCGCCAGCT GGATGACGGC 150 

ACGTTCGACT TCTCCGGCTC CAAGGCCGCG CTGGTGTACG GCCAGATGAA 200 

CGAGCCGCCA GGTGCCAGGG CGCGTGTTGC CCTCACTGGC CTGACGGTGG 250 

CCGAGTACTT CCGTGATGAG GACGGCCAGG ACGTGCTGCT CTTCATCGAC 300 

20 AACATCTACC GTTTCACCCA GGCTGGTTCT GAGGTGAGTG CCCTTTTGGG 350 

GCGCATCCCG TCCGCCGTCG GTTACCAGCC GACCCTCGCC - ACCGACCTTG 400 

GCGCGCTGCA GGAGCGTATC ACGACGACCA ACAAGGGCTC CATCACCTCC 450 

GTGCAGGCCG TCTACGTGCC GGCCGACGAT ATCACCGACC CGGCGCCTGC 500 

GACCACCTTC ACCCATCTGG ACGCGACCAC TGTGCTCTCC CGTTCCATCG 550 

25 CCGAGCTGGG TATCTACCCC GCCGTCGACC CGCTCGACTC CACCTCGCGT 600 

ATGCTGTCCG CGAACATCGT CGGCGAGGAG CAGTACAACG TGGCGCGTGG 650 

CGTGCAGAAA ATACTGCAGG ACTACAAATC GCTGCAGGAT ATCATCGCCA 700 

TCCTGGGTAT GGACGAGCTG TCTGAGCAGG ACAAGTTCGT CGTCGCGCGT 750 

GCGCGCAAGG TTCAGCGTTT CCTATCCCAG CCCTTCCAGG TGGCTGAGGT 800 

30 ATTCACCGGC AAGCCCGGAC GTTTCGTCGA GCTGCAGGAC ACCATCAGCG 850 

GCGTCAAGGA GATTTTGGAC GGCGAGTGCG ACGACATG 888 



35 2) INFORMATION FOR SEQ ID NO: 857 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 884 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bovis 

(B) STRAIN: Suarez-3 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 857 



TGATTATGGA ATTGATCAAC AATGTCGCCA AGAAACACGG TGGGTTCTCC 50 

GTGTTCGCTG GTGTTGGTGA ACGTACGAGG GAAGGTAACG AACTGTACCA 100 

TGAAATGATG GAAACGGGTG TCATCAAGCG CCGTCAACTG GAAGACGGAA 150 

CATTTGACTT CTCGGGCTCT AAAGCTGCTT TGGTGTACGG ACAAATGAAC 200 

55 GAACCACCAG GTGCTAGAGC CCGTGTTGCA CTCACGGGAT TGACCGTTGC 250 

CGAGTATTTC CGTGATGAAG AGGGGCAGGA TGTGCTACTC TTCATCGATA 3 00 

ACATCTACCG TTTCACCCAG GCCGGTTCCG AAGTGAGTGC GCTGTTAGGA 350 

AGAATTCCAT CCGCCGTGGG TTATCAACCT ACATTGGCCA CTGATCTCGG 400 

AGCACTCCAG GAACGCATTA CTACAACCAA CAAGGGTTCG ATTACATCAG 450 

60 TCCAGGCAGT ATACGTCCCA GCCGATGATA TCACTGATCC CGCTCCAGCT 500 

480 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



ACCACTTTCT CGCACTTGGA 
GGAGTTGGGT ATTTACCCTG 
TGCTGTCGGC CAACATTGTA 
GTACAGAAAA TTTTACAGGA 
TCTGGGTATG GACGAGCTGT 
CCCGTAAGGT ACAGCGTTTC 
TTCACCGGCA AGCCTGGGAG 
TGTCAAGGAA ATCTGGAAGG 



TGCCACTACA 
CGGTCGACCC 
GGACAGGAAC 
CTACAAATCA 
CTGAGCAGGA 
CTGTCTCAGC 
GTTCGTTGAA 
TGAGTGTGAC 



GTGCTTTCTC GTTCAATTGC 
GCTTGACTCA ACGTCACGTA 
AGTACGATGC CGCACGTGGT 
CTGCAGGATA TCATTGCCAT 
CAAGTTCGTT GTAGCACGCG 
CGTTCCAAGT GGCTGAGGTG 
CTACAGGATA CCATCAGCGG 
GATA 



550 
600 
650 
700 
750 
800 
850 
884 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 858 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 871 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia microtti 

(B) STRAIN: Persing-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 858 

TGGAACTGAT TAATAATGTG GCCAAAAAGC ATGGCGGTTA CTCTGTTTTT 50 

GCAGGTGTAG GTGAAAGGAC GAGGGAGGGT AATGAATTGT ACCATGAAAT 100 

GATGGAGACA GGTGTTATAA AGAAAAAGGC ACTAGGTGGT GGGAAGTTTG 150 * 

ATTTCAGTGG ATCTAAAGCA GCGCTGGTCT ATGGACAAAT GAACGAGCCA 200 

CCTGGGGCCC GTGCTAGAGT GGCACTAACT GGATTAACAG TCGCAGAATA 250 

TTTCCGTGAC GAACAAGGAC AAGACGTGTT GTTGTTTATT GATAATATTT 300 

ACCGATTTAC TCAGGCAGGG TCTGAGGTTT CAGCCTTGCT AGGCCGTATA 350 

CCTTCAGCTG TGGGATACCA GCCTACATTG GCAACAGATC TTGGCTGTTT 400 

ACAAGAACGA ATTACTACGA CCAAATCTGG TTCAATCACC AGTGTACAAG 450 

CTGTGTATGT GCCAGCAGAT GATATTACTG ATCCAGCGCC TGCCACAACT 500 

TTTACTCACT TGGACGCTAC TACTGTACTT AGCAGGCCAA TTGCTGAACT 550 

CGGTATTTAT CCAGCGGTAG ACCCGTTGGA TTCAACAAGC CGTATGCTAA 600 

GCGCGAACAT TGTGGGAAAT GAACACTATA GTGTAGCCCG TTCCGTGCAG 650 

AAGATACTGC AAGATTACAA ATCGCTTCAG GACATTATTG CCATTTTGGG 700 

TATGGATGAA CTGTCGGAAC AAGACAAAAA TATAGTAGCC CGAGCAAGGA 750 

AGATGCAAAG GTTCTTATCA CAGCCATTCC AAGTGGCGGA AGTTTTTACT 800 

GGTAAACCGG GAAGATTTGT GGAATTGGAA GATACAATTG CCGGGGCACG 850 

AGATATAATT GCGGGTAATT G 871 



2) INFORMATION FOR SEQ ID NO: 859 

* 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1255 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania guyanensis 

481 
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(B) STRAIN: ATCC 50126 



PCT/CA00/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 859 

5 TGACGGCGCT GGACGTGACG GAGGACCTCG GCCGCGATGA GCCGCTGACG 50 

CTGGAGATCG TGCAGCACTT GGATGCGAAC ACCGGCCGCT GCATTGCGAT 100 

GCAGACGACG GACCTGCTGA AGCTGAAGTC GAAGGTTGTG TCGACCGGCG 150 

GCAACATCTC CGTGCCGGTG GGCCGCGAGA CACTGGGCCG CATCTTCAAC 2 00 

GTGCTGGGCG ACGCGATTGA CCACCGCGGC CCCGTGTGCG AGAAGATGCG 2 50 

10 CATGGCGATC CACGCCGAGG CGCCGAAGCT GGCGGACCAG GCTGCGGAGG 3 00 

ACACGATCCT GACGACCGGC ATCAAGGTGA TCGACCTGAT TCTGCCCTAC 350 

TGCAAGGGCG GCAAGATCGG CCTGTTCGGC GGTGCCGGTG TGGGCAAGAC 400 

TGTGATCATC ATGGAGCTGA TCAACAACGT CGCGAAGGGG CACGGCGGCT 4 50 

TCTCCGTGTT CGCCGGCGTT GGCGAGCGCA CGCGCGAGGG CACGGACCTG 500 

15 TACCTGGAGA TGATGCAGTC AAAGGTGATT GACCTGAAGG GCGAGTCGAA 550 

GTGCGTGCTT GTGTACGGGC AGATGAACGA GCCCCCGGGT GCGCGCGCGC 600 
GCGTTGCGCA GTCTGCGCTG ACGATGGCCG AGTACTTCCG CGACGTGGAG ' 650 

GGCCAGAACG TGCTGCTGTT CATCGACAAC ATCTTCCGCT TCACGCAGGC 700 

GAACTCCGAG GTGTCTGCGC TGCTGGGCCG CATCCCGGCC GCCGTGGGTT 750 

20 ACCAGCCGAC GCTTGCGGAG GATCTTGGCA TGCTGCAGGA GCGCATTACG 800 

TCGACGACGA AGGGATCGAT TACGTCTGTG CAGGCTGTGT ACGTGCCTGC 850 

GGATGATATC ACGGACCCCG CGCCCGCGAC GACGTTCTCG CACCTGGACG 900 

CGACGACTGT GCTGGACCGC GCGGTGGCGG AGTCGGGCAT CTACCCTGCC 950 

GTGAACCCGC TGGAGTGCGC GTCGCGCATC ATGGACCCCG ATGTGATCGA 1000 

25 CGTGGACCAT TACAACGTTG CACAGGATAT CGTCCAGATG CTGACCAAGT 1050 

ACAAGGAGCT GCAGGACATC ATTGCGGTGC TTGGCATCGA CGAGCTGAGC 1100 

GAGGAGGACA AGGTCGTGGT GGACCGCGCG CGCAAGGTGA CGCGGTTCCT 1150 

GTCGCAGCCG TTCCAGGTTG CGGAGGTGTT CACCGGCATG ACGGGCCACT 12 00 

ACGTGCAGCT GAGCGACACG GTGGAGTCGT TCTCTGGCCT GCTGATGGGG 12 50 

30 * TCGTA 1255 



2) INFORMATION FOR SEQ ID NO: 860 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1222 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS : Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Leishmania mexicana 

(B) STRAIN: ATCC 50156 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 860 

50 CTCGGAGGGC GTCCCGCCCG TGCTGACGGC GCTGGATGTG ACGGAGGACC 50 

TTGGCCGCGA TGAGCCGCTG ACGCTGGAGA TCGTGCAGCA CCTGGACGCG 100 

AACACCGGCC GCTGCATTGC GATGCAGACG ACGGACCTGC TGAAGCTGAA 150 

GTCGAAGGTT GTGTCGACCG GCGGCAACAT CTCTGTGCCG GTGGGCCGTG 200 

AGACGCTGGG CCGCATCTTC AACGTGCTGG GCGACGCGAT CGACCAGCGC 250 

55 GGCCCCGTGG GTGAGAAGAT GCGCATGGCG ATCCACGCCG AGGCCCCGAA 300 

GCTGGCGGAT CAGGCCGCGG AGGACACGAT CCTGACGACC GGCATCAAGG 350 

TGATCGACCT GATTCTGCCC TACTGCAAGG GTGGCAAGAT CGGCCTGTTT 400 

GGCGGCGCCG GTGTGGGCAA GACCGTGATC ATCATGGAGC TGATTAACAA 450 

CGTCGCGAAG GGCCACGGTG GTTTCTCGGT GTTTGCCGGC GTTGGCGAGC 500 

60 GCACGCGCGA GGGCACGGAC CTGTACCTGG AGATGATGCA GTCGAAGGTG 550 

482 
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ATTGACCTGA AGGGCGAGTC GAAGTGCGTG CTTGTGTACG GGCAGAT&AA * 600 

CGAGCCCCCG GGTGCGCGCG CGCGCGTTGC GCAGTCTGCG CTGACGATGG 650 

CGGAGTACTT CCGAGACGTG GAGGGCCAGA ATGTGCTGCT GTTCATCGAC 700 

AACATCTTCC GCTTCACGCA GGCGAACTCC GAGGTCTCTG CGCTGCTGGG 750 

5 CCGCATTCCG GCCGCCGTGG GCTACCAGCC GACGCTTGCG GAGGATCTTG 800 

GTATGCTGCA GGAGCGCATC ACGTCGACGA CGAAGGGGTC GATCACGTCC 850 

GTGCAGGCCG TGTACGTGCC TGCGGATGAT ATCACGGATC CGGCGCCCGC 900 

GACGACGTTC TCGCACCTGG ACGCGACGAC TGTGCTGGAC CGCGCGGTGG 950 

CGGAGTCGGG GATCTACCCT GCCGTGAACC CGCTGGAGTG CGCGTCGCGT 1000 

10 ATCATGGACC CCGATGTGAT CGACGTGGAC CACTACAACG TTGCGCAGGA 1050 

TATCGTGCAG ATGCTGACCA AGTACAAGGA GCTGCAGGAT ATCATTGCGG 110 0 

TGCTTGGTAT CGACGAGCTG AGCGAGGAGG ACAAGGTCGT GGTGGACCGC 1150 

GCGCGCAAGG TGACCCGGTT CCTGTCGCAG CCGTTCCAGG TTGCGGAGGT 120 0 

GTTCACGGGC ATGACGGGCC AC 1222 

15 



2) INFORMATION FOR SEQ ID NO: 861 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 
' 30 (B) STRAIN: ATCC 50129 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 861 

CCGTGCTGAC GGCGCTGGAT GTGACGGAGG ACCTTGGCCG CGATGAGCCG 50 

35 CTGACGCTGG AGATCGTGCA GCACTTGGAC GCGAACACCG GCCGCTGCAT 100 

TGCGATGCAG ACGACGGACC TGCTGAAGCT GAAGTCGAAG GTTGTGTCGA 150 

CCGGCGGCAA CATCTCTGTG CCGGTGGGCC GTGAGACGCT GGGCCGCATC 2 00 

TTCAACGTTC TGGGCGACGC GATCGACCAG CGCGGCCCCG TGGGCGAGAA 250 

GATGCGCATG GCGATCCACG CCGAGGCCCC GAAGCTGGCG GATCAGGCCG 3 00 

40 CGGAGGACAC GATCCTGACG ACCGGCATCA AGGTGATCGA CCTGATTCTG 3 50 

CCCTACTGCA AGGGTGGCAA GATCGGCCTG TTCGGCGGTG CCGGTGTGGG 4 00 

CAAGACTGTG ATCATCATGG AGCTGATCAA CAACGTCGCG AAGGGCCACG 450 

GTGGTTTCTC CGTGTTTGCC GGCGTTGGCG AGCGCACGCG CGAGGGCACG 500 

GACCTGTACC TGGAGATGAT GCAGTCGAAG GTGATTGACC TGAAGGGCGA 550 

45 GTCGAAGTGC GTGCTTGTGT ACGGGCAGAT GAACGAGCCC CCGGGTGCGC 600 

GCGCGCGCGT TGCGCAGTCT GCGCTGACGA TGGCGGAGTA CTTCCGCGAC 650 

GTGGAGGGCC AGAACGTGCT GCTGTTCATC GACAACATCT TCCGCTTCAC 700 

GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT GGGCCGCATT CCGGCCGCCG 750 

TGGGCTACCA GCCGACGCTT GCGGAGGATC TTGGTATGCT GCAGGAGCGC 800 

50 ATCACGTCGA CAACGAAGGG GTCGATCACG TCCGTGCAGG CCGTGTACGT 850 

GCCAGCGGAT GATATCACGG ATCCCGCGCC CGCGACGACG TTCTCGCACC 900 

TGGACGCGAC GACTGTGCTG GACCGCGCGG TGGCGGAGTC GGGCATCTAC 950 

CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG CGTATCATGG ACCCCGATGT 10 00 

GATCGATGTG GACCACTACA ACGTTGCGCA GGATATCGTG CAGATGCTGA 1050 

55 CCAAGTACAA GGAGCTGCAG GATATCATTG CGGTGCTTGG CATCGACGAG 1100 

CTGAGCGAGG AAGACAAGGT TGTTGTGGAC CGCGCGCGCA AGGTGACCCG 1150 

GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA GGTGTTCACG GGCATGACGG 1200 

GCCACTACGT GCAGCTGGTC GACACGGTGG AGTCGTTCTC TGGCCT 1246 

60 

483 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 862 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1265 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 30815 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 862 



GGCGTGCCGC CCGTGCTGAC GGCGCTGGAT GTGACGGAGG ACCTTGGCCG 50 

CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAC GCGAACACGG 100 

2 0 GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT GAAGTCGAAG 150 

GTCGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC GTGAGACGCT 200 

GGGCCGCATC TTCAAYGTTC TGGGCGACGC GATCGACCAG CGCGGCCCCG 250 

TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC GAAGCTGGCG 300 

GATCAGGCCG CGGAGGACAC GATCCTGACG ACCGGCATCA AGGTGATCGA 350 

25 CCTGATTCTG CCCTACTGCA AGGGTGGCAA GATCGGCCTG TTCGGCGGTG 400 

CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA CAACGTCGCG 450 

AAGGGCCACG GCGGTTTCTC CGTGTTTGCC GGCGTTGGCG AGCGCACGCG 500 

CGAGGGCACG GACCTGTACC TGGAGATGAT GCAGTCGAAG GTGATTGACC 550 

TGAAGGGCGA GTCGAAGTGY GTGCTTGTGT ATGGGCAGAT GAACGAGCCC 600 

3 0 CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA TGGCGGAGTA 650 

CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC GACAACATCT 700 

TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT GGGCCGCATT 750; 

CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCGGAGGATC TTGGTATGCT 800^ 

GCAGGAGCGC ATCACGTCGA CAACGAAGGG GTCGATCACG TCCGTGCAGG 850 

35 CCGTGTACGT GCCAGCGGAT GATATCACGG ATCCCGCGCC CGCGACGACG 900 

TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGCGCGG TGGCGGAGTC 950 

GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG CGTATCATGG 1000 

ACCCTGATGT GATCGATGTG GACCACTACA ACGTTGCGCA GGATATCGTG 1050 

CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG CGGTGCTTGG 1100 

40 CATCGACGAG CTGAGCGAGG AAGACAAGGT TGTTGTGGAC CGCGCGCGCA 1150 

AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA GGTGTTCACG 1200 

GGCATGACGG GCCACTACGT GCAGCTGGTC GACACGGTGG AGTCGTTCTC 1250 

TGGCCTGCTG ATGGG 1265 

45 

2) INFORMATION FOR SEQ ID NO: 863 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1191 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bordetella pertussis 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 863 

484 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



ATGGCAAAAG GCAAGTTTGA ACGTACCAAG CCGCACGTGA ACGTGGGTAC 50' 

GATTGGTCAC GTTGACCACG GCAAAACGAC GTTGACGGCG GCGATCACGA 100 

CGGTGCTGTC GAACAAGTTC GGCGGCGAGG CTCGCGGCTA CGACCAGATT 150 

5 GACGCGGCGC CGGAAGAGAA GGCGCGTGGG ATCACGATCA ACACCTCGCA 200 

CGTTGAGTAC GAGACGGAGA CGCGTCACTA CGCGCACGTT GATTGCCCGG 250 

GTCACGCTGA CTACGTGAAG AACATGATCA CGGGTGCTGC GCAGATGGAC 300 

GGCGCGATCC TGGTGGTGTC GGCCGCAGAC GGCCCGATGC CGCAGACGCG 350 

CGAGCACATT TTGCTGTCGC GCCAGGTTGG CGTGCCGTAC ATCATCGTGT 400 

10 TCCTGAACAA GGCGGACATG GTTGATGACG CGGAGCTGCT CGAGCTGGTG 450 

GAGATGGAAG TCCGCGAACT GCTGAGCAAG TACGATTTCC CGGGCGATGA 500 

CACGCCGATC GTGAATGGTG CGGCCAAGCT GGCGCTGGAA AGCGACAACG 550 

GCGACCTGGG CGAGCAGGCG ATTCTGTCGC TGGCGCAAGC GCTGGACACG 600 

TACATTCCGA CGCCGGAGCG CGCGGTCGAC GGTGCGTTCC TGATGCCGGT 650 

15 GGAAGACGTG TTCTCGATCT CGGGCCGTGG CACGGTGGTG ACTGGCCGTA 700 

TCGAGCGCGG CGTGGTGAAG GTTGGCGAGG AAATCGAAAT CGTGGGCATC 750 

AAGCCGACGG TGAAGACGAC CTGCACGGGC GTGGAGATGT TCCGCAAGCT 800 

GCTGGACCAG GGCCAGGCGG GCGACAACGT GGGTATCTTG CTGCGCGGCA 850 

CCAAGCGTGA AGACGTCGAG CGTGGCCAGG TGCTGGCCAA GCCGGGTTCG 900 

20 ATCAACCCGC ACACGGACTT CACGGCCGAG GTGTACATTC TGTCCAAGGA 950 

AGAGGGTGGC CGTCACACGC CGTTCTTCAA CGGCTATCGT CCGCAGTTCT 1000 

ACTTCCGCAC GACGGACGTG ACCGGCACGA TCGACCTGCC GGCGGACAAG 1050 

GAAATGGTGC TGCCGGGCGA CAACGTGTCG ATGACCGTCA AGCTGCTGGC 1100 

CCCGATCGCC ATGGAAGAAG GTCTGCGTTT CGCCATCCGT GAAGGCGGTC 1150 

25 GTACCGTCGG TGCCGGCGTC GTCGCCAAGA TCATCAAGTA A 1191 



2) INFORMATION FOR SEQ ID NO: 864 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1350 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double, 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Trypanosoma brucei 

(B) STRAIN: LVH/ 75/ US AMRU - K/ 1 8 

(C) ACCESSION NUMBER: extracted from U10562 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 864 



45 



ATGGGAAAGG AAAAGGTGCA CATGAATCTT GTGGTGGTGG GCCACGTCGA 50 

TGCCGGTAAA TCCACTGCAA CGGGTCACTT GATCTACAAG TGCGGTGGTA 100 

TTGACAAACG TACGATCGAG AAGTTCGAGA AAGAAGCTGC CGACATTGGT 150 

AAGGCCTCAT TCAAGTACGC ATGGGTGCTG GACAAGCTGA AGGCTGAGCG 200 

50 CGAACGTGGT ATCACGATCG ACATTGCACT GTGGAAATTC GAGTCACCCA 250 

AGTCTGTCTT CACTATTATT GATGCTCCTG GGCACCGTGA CTTCATCAAG 300 

AACATGATCA CCGGCACATC GCAAGCCGAC GCAGCCATCC TCATCATTGC 350 

CTCTGCGCAG GGTGAGTTCG AGGCTGGTAT CTCCAAGGAT GGACAGACCC 400 

GCGAGCACGC GTTGCTGGCC TTCACTTTGG GTGTGAAGCA GATGGTTGTG 450 

55 TGCTGCAACA AAATGGACGA CAAGACTGTG AACTACGGAC AGGAGCGGTA 500 

TGACGAGATT GTGAAGGAGG TGTCTGCTTA CATCAAGAAG GTTGGGTACA 550 

ACGTGGAGAA GGTGCGCTTC GTCCCCATCT CCGGATGGCA GGGCGACAAC 600 

ATGATTGAGA AATCCGAGAA GATGCCATGG TACAAGGGTC CAACGCTCCT 650 

GGAGGCACTA GACATGCTGG AGCCACCAGT GCGTCCGAGC GACAAGCCCC 700 

60 TGCGTCTGCC ACTGCAGACG TGTACAAAGA TCGGTGGTAT TGGCACCGTG 750 

485 
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CCCGTTGGTC GTGTGGAGAC CGGCGTGATG AAGCCTGGTG ATGTGGTGAC 800/ 

GTTTGCCCCC GCCAACGTGA CGACCGAGGT GAAATCGATC GAGATGCACC 850 

ACGAGCAGCT CGCTGAGGCG ACCCCCGGTG ACAACGTCGG J CTTTAACGTG 900 

AAGAACGTTT CTGTAAAGGA CATCCGCCGT GGCAACGTCT GCGGTAACAC 950 

5 CAAGAACGAC CCCCCAAAGG AGGCCGCCGA CTTCACGGCA CAGGTGATCA 1000 

TCCTGAACCA CCCCGGACAG ATTGGAAACG GTTATGCGCC CGTGCTGGAC 1050 

TGCCACACAT CGCACATTGC CTGCAAGTTC GCGGAGATCG AGTCGAAGAT 1100 

CGACCGTCGC TCTGGCAAGG AGCTGGAGAA GGCTCCCAAG TCGATCAAGT 1150 

CTGGCGACGC CGCGATCGTG CGCATGGTGC CGCAGAAGCC TATGTGCGTG 1200 

10 GAGGTCTTCA ACGACTACGC GCCACTCGGC CGCTTTGCCG TGCGTGACAT 1250 

GCGCCAGACC GTCGCTGTCG GTATCATCAA GGCCGTGACC AAGAAGGACG 13 00 

GTTCTGGTGG TAAGGTGACG AAGGCTGCGG TGAAGGCTTC GAAGAAATAA 1350 



2) INFORMATION FOR SEQ ID NO: 865 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1052 bases 
2 0 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 865 



AAGCTCAAGG CTGAGAGAGA AAGAGGTATT ACCATCGATA TTGCTTTATG 50 

GCAATTCGAA ACCCCAAAAT ACCACTACAC TGTCATTGAT GCCCCAGGTC 100 

ACAGAGATTT CATCAAGAAT ATGATTACTG GTACCTCTCA AGCTGATGTT 150 

GCTTTATTGG TTGTCCCAGC CGATCGTTTC GAAGGTGCCT TCTCCAAGGA 200 

35 AGGTCAAACC AGAGAACATG CTTTATTGGC CTTCACTTTG GGTGTCAGAC 250 

AAATGATTGT CGGTATTAAC AAGATGGATA C CTGTGAATA CAAGCAATCT 300 

CGTTTTGATG AAATCTTCAA CGAAGTTGAT GGTTACCTCA AGAAGGTTGG 350 

TTACAACACC GAGAAGATCC CATTCGTTGC CATTTCTGGT TTCGTTGGTG 400 

ATAATATGGT TGAGAGATCT GACAAGATGC CATGGTATAA GGGTAAGACC 450 

40 TTAGTCGAAG CCCTCGACAC TATGGAACCA CCAAAGAGAC CAACTGACAA 500 

GCCACTCCGT CTCCCATTAC AAGATGTTTA CAAGATAGGT GGTGTAGGTA 550 

CTGTCCCAGT CGGTCGTGTT GAGACTGGTA TCATCAGACC AGGTATGAAT 600 

GTTACCTTCG CTCCAGCTGG TGTTACCACT GAAGTTAAGT CAGTAGAAAT 650 

GCACCATGAG CAGATGCCAG AGGCCGTCCC AGGTGACAAC GTTGGTTTCA 700 

45 ATGTTAAGAA CGTCTCCATC AAGGATATCA AGAGAGGTTT CGTTGCTTCT 750 

GATGCCAAGA ATGACCCAGC TAAGGGCTGT GAAGACTTCA CTGCTCAAGT 800 

TATCGTCCTC AACCACCCAG GTGAAATCAA GAACGGTTAC TCTCCAGTCG 850 

TTGACTGTCA CACCGCTCAC ATTTCCTGCA AATTCCAGAC TATCACTGCT 900 

AAGATGGACA AGAGATCTGG TAAGGTTTTG GAAGAAAACC CAAAGCTTAT 950 

50 CAAGTCTGGT GATGCTGCTT TGGTTGTTAT GCAACCTTTG AAGCCACTTT 1000 

GTGTTGAGGC CTTCACTGAC TACCCACCTC TAGGTCGTTT CGCTGTCCGT 1050 

GA 1052 



2) INFORMATION FOR SEQ ID NO: 866 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 83 7 bases 
60 (B) TYPE: Nucleic acid 

486 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: ATCC 35552 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 866 

CAATGAAGTT CCAGAAATTA ACAATGCCTT AGTCGTAGAC GTTGAAAGAG 50 

ATGAAGGTAC AGTATCTCTT ACATTAGAAG TGGCATTACA ACTTGGCGAT 100 

GATGTCGTAC GTACAATTGC AATGGATTCT ACTGATGGTG TTAAACGTGG 150 

15 TACAGAAGTT CGAGATAGCG GAGATAGCAT CAGTGTTCCA GTTGGTGATG 200 

CTACGTTAGG ACGTGTGTTT AATGTTCTTG GTGATACAAT TGACTTAGAC 250 

GAGAAGCTTG ATACTTCTGT CAAACGTGAT CCAATTCATA GAGAAGCACC 300 

TGCATTCGAT CAATTATCAA CAAAAGTTGA AATCTTAGAA ACAGGTATTA 350 

AAGTAATTGA TTTACTTGCA CCATATATTA AAGGTGGTAA AATCGGTTTA 400 

2 0 TTCGGTGGCG CTGGTGTAGG TAAAACAGTA TTAATTCAAG AATTAATTAA 450 

TAATATAGCT CAAGAACATG GTGGTATTTC AGTATTTGCC GGCGTAGGTG 500 

AACGTACGCG TGAAGGTAAT GACTTATACT ACGAAATGAG TGATAGTGGT 550 

GTTATTAAGA AAACAGCTAT GGTCTTCGGA CAAATGAATG AGCCACCTGG 600 

TGCGCGTATG CGTGTTGCTT TATCAGGCTT AACAATGGCT GAACACTTCC 650 

25 GTGATGTACA AGGACAAGAT GTTTTACTAT TTATTGATAA CATATTCAGA 700 

TTTACGCAAG CTGGTTCAGA AGTATCAGCA CTATTAGGTC GTATGCCATC 750 

AGCCGTTGGT TATCAACCTA CCCTTGCTAC TGAAATGGGT CAATTACAAG 800 

AACGTATTAC ATCAACAACT AAAGGATCTG TAACGTC 837 

30 

2) INFORMATION FOR SEQ ID NO: 867 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zoogloea ramigera 

(B) STRAIN: ATCC 25935 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 867 



45 



AAGGTATTCG ATGCCTTGAA AATGGAAGGC TCCGAGCTGA CCCTGGAAGT 50 

ACAACAGCAG CTGGGCGACG GCATTGTCCG TACCATTGCA CTGGGTACCT 100 

50 CCGACGGCCT GCGTCGCGGC ATGATGATCC AGAACACCGG CAAAC CTATC 150 

ATGGTGCCAG TCGGTAAAGC AACCCTGGGT CGCATCATGG ACGTGCTGGG 200 

TAACCCGATC GACGAATGCG GCGCGGTCGC TCACGACCAG ATCGCTTCGA 250 

TCCACCGCGC TCCTCCTGCG TACGACGAAC TGTCGCCATC GCAAGATCTG 300 

CTGGAAACCG GCATTAAAGT TATTGACCTG GTGTGCCCGT TCGCCAAGGG 350 

55 CGGTAAAGTC GGTCTGTTCG GCGGTGCAGG TGTGGGCAAG ACCGTGAACA 400 

TGATGGAACT GATCAACAAC ATCGCCAAAG CACACTCGGG TCTGTCCGTG 450 

TTTGCCGGTG TGGGTGAGCG TACCCGTGAA GGTAACGACT TCTACCACGA 500 

GATGGCTGAC GCCAAAGTGG TCGATCTGGA AAATCCAGAG AACTCCAAGG 550 

TTGCGATGGT CTACGGTCAG ATGAATGAAC CACCAGGCAA CCGTCTGCGC 600 

60 GTGGCGCTGA CCGGTCTGAC CATGGCTGAA GCATTCCGTG ACGAAGGCAA 650 
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AGACGTTCTG TTCTTCGTGG 

CCGAAGTATC GGCACTGCTG 

CCTACGCTGG CCGAAGAAAT 

CAAGACCGGT TCGATCAC 



ACAACATCTA CCTgCTTCACC 
GGCCGTATGC CATCGGCTGT 
GGGTCGCCTG CAAGAGCGCA 



PCT/CA00/01150 

CTGGCCG(?tfA ~ "70 0~ 
GGGTTACCAG 750 
TCACTTCGAC 800 

818 



2) INFORMATION FOR SEQ ID NO: 868 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: ATCC 43867 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 868 

CTATCTTAGT AGTATCTGCT GCTGATGGCC CAATGCCACA AACTCGTGAA 50 

CACATTCTTT TATCACGTAA CGTTGGTGTT CCAGCATTAG TTGTATTCTT 100 

AAACAAAGTT GACATGGTTG ACGATGAAGA ATTATTAGAA TTAGTAGAAA 150 

TGGAAGTTCG TGACTTATTA AGCGAATATG ACTTCCCAGG TGACGATGTA 200 

CCTGTAATCT CTGGTTCTGC ATTAAAAGCT. TTAGAAGGCG ACGCTGACTA 250 

TGAGCAAAAA ATCTTAGACT TAATGCAAGC TGTTGATGAC TTCATTCCAA 300 

CACCAGAACG TGATTCTGAC AAACCATTCA TGATGCCAGT TGAGGACGTA 350 

TTCTCAATCA CTGGTCGTGG TACTGTTGCT ACAGGCCGTG TTGAACGTGG 400 

TCAAATCAAA GTCGGTGAAG AAATCGAAAT CATCGGTATG CAAGAAGAAT 450 

CAAGCAAAAC AACTGTTACT GGTGTAGAAA 1 TGTTCCGTAA ATTATTAGAC 500 

TACGCTGAAG CTGGTGACAA CATTGGTGCA TTATTACGTG GTGTTTCACG 550 

TGATGACGTA CAACGTGGTC AAGTTTTAGC TGCTCCTGGT ACTATTACAC 600 

CACATACAAA ATTCAAAGCG GATGTTTACG TTTTATCTAA AGATGAAGGT 650 

GGTCGTCATA CACCATTCTT CACTAACTAC CGCCCACAAT TCTATTTCCG 700 

TACTACTGAC GTAACTGGTG TTGTTAACTT ACCAGAAGGT ACTGAAATGG 750 

TTATGCCTGG CGATAACGTT GAAATGGA 778 



2) INFORMATION FOR SEQ ID NO: 869 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 640 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseli flaws 

(B) STRAIN: R689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 869 

TGGTCCTATG CCTCAAACAC GTGAACACAT CTTGTTATCA CGTAACGTTG 50 
GTGTACCATA CATCGTTGTT TTCTTAAACA AAATGGATAT GGTTGATGAC A 100 
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GAAGAATTAC 
ATATGACTTC 
AAGCTCTTGA 
GCTGCAGTTG 
ATTCATGATG 
TTGCTACAGG 
GAAATCGTTG 
TGAAATGTTC 
GTGCATTGCT 
TTGGCTAAAG 
TTACGTTTTA 



TAGAATTAGT 
CCAGGCGACG 
AGGCGATGCT 
ACGAATACGT 
CCAGTCGAAG 
CCGTGTTGAA 
GTATTGCTGA 
CGTAAATTGT 
ACGTGGTGTT 
CTGGTACAAT 
ACAAAAGAAG 



TGAAATGGAA 
ATGTTCCTGT 
TCATACGAAG 
TCCAACTCCA 
ACGTATTCTC 
CGTGGACAAG 
AGAAACTGCT 
TAGACTATGC 
GCTCGTGAAG 
CACACCTCAT 
AAGGTGGACG 



GTTCGTGACT TATTGT6AGA 
AATCGCTGGT TCTGCTTTGA 
AAAAAATCAT GGAATTAATG 
GAACGTGACA CTGACAAACC 
AATCACTGGA CGTGGTACTG 
TTCGCGTTGG TGACGAAGTT 
AAAACAACTG TAACTGGTGT 
TGAAGCAGGG GATAACATTG 
ACATCCAACG TGGACAAGTA 
ACAAAATTTA AAGCTGAAGT 
TCACACTCCA 



150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
640 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 870 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 644 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

<ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseli flaws 

(B) STRAIN: R754 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 870 

GTCCTATGCC TCAAACACGT GAACACATCT TGTTATCACG TAACGTTGGT 50 

GTACCATACA TCGTTGTTTT CTTAAACAAA ATGGATATGG TTGATGACGA 100 

AGAATTACTA GAATTAGTTG AAATGGAAGT TCGTGACTTA TTGTCAGAAT 150 

ATGACTTCCC AGGCGACGAT GTTCCTGTAA TCGCTGGTTC TGCTTTGAAA 2 00 

GCTCTTGAAG GCGATGCTTC ATACGAAGAA AAAATCATGG AATTAATGGC 2 50 

TGCAGTTGAC GAATACGTTC CAACTCCAGA ACGTGACACT GACAAACCAT 3 00 

TCATGATGCC AGTCGAAGAC GTATTCTCAA TCACTGGACG TGGTACTGTT 3 50 

GCTACAGGCC GTGTTGAACG TGGACAAGTT CGCGTTGGTG ACGAAGTTGA 4 00 

AATCGTTGGT ATTGCTGAAG AAACTGCTAA AACAACTGTA ACTGGTGTTG 450 

AAATGTTCCG TAAATTGTTA GACTATGCTG AAGCAGGGGA TAACATTGGT 500 

GCATTGCTAC GTGGTGTTGC TCGTGAAGAC ATCCAACGTG GACAAGTATT 550 

GGCTAAAGCT GGTACAATCA CACCTCATAC AAAATTTAAA GCTGAAGTTT 600 

ACGTTTTAAC AAAAGAAGAA GGTGGACGTC ACACACCATT CTTC 644 



2) INFORMATION FOR SEQ ID NO: 871 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 7 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 



(B) STRAIN: R758 
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TCCTATGCCT CAAACACGTG AACACATCTT GTTATCACGT AACGTTGGTG 50 

TACCATACAT CGTTGTTTTC TTAAACAAAA TGGATATGGT TGATGACGAA 100 

5 GAATTACTAG AATTAGTTGA AATGGAAGTT CGTGACTTAT TGTCAGAATA 150 

TGACTTCCCA GGCGACGATG TTCCTGTAAT CGCTGGTTCT GCTTTGAAAG 200 

CTCTTGAAGG CGATGCTTCA TACGAAGAAA AAATCATGGA ATTAATGGCT 250 

GCAGTTGACG AATACGTTCC AACTCCAGAA CGTGACACTG ACAAACCATT 300 

CATGATGCCA GTCGAAGACG TATTCTCAAT CACTGGACGT GGTACTGTTG 350 

10 CTACAGGCCG TGTTGAACGT GGACAAGTTC GCGTTGGTGA CGAAGTTGAA 400 

ATCGTTGGTA TTGCTGAAGA AACTGCTAAA ACAACTGTAA CTGGTGTTGA 450 

AATGTTCCGT AAATTGTTAG ACTATGCTGA AGCAGGGGAT AACATTGGTG 500 

CATTGCTACG TGGGGTTGCT CGTGAAGACA TCCAACGTGG ACAAGTATTA 550 

GCTAAAGCTG GTACAATCAC ACCTCATACA AAATTTAAAG C TGAAGTTTA 600 

15 CGTTTTAACA AAAGAAGAAG GTGGACGTCA CACTCCA 637 



2) INFORMATION FOR SEQ ID NO: 872 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 643 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A)* ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R631 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 872 

T 

35 GTCCTATGCC TCAAACTCGT GAACACATCT TGTTATCACG TAACGTTGGC 50 

GTACCATACA TCGTTGTTTT CTTGAACAAA ATGGATATGG TTGATGACGA 100 

AGAATTGCTA GAATTAGTTG AAATGGAAGT TCGTGACCTA TTGTCTGAAT 150 

ATGACTTCCC AGGCGACGAT GTTCCTGTAA TCGCCGGTTC TGCTTTGAAA 200 

GCTCTTGAAG GAGATCCTTC ATACGAAGAA AAAATCATGG AATTGATGGC 250 

40 TGCAGTTGAC GAATACGTTC CAACTCCAGA ACGTGATACT GACAAACCAT 300 

TCATGATGCC AGTCGAAGAC GTATTCTCAA TCACTGGACG TGGTACTGTT 350 

GCTACAGGCC GTGTTGAACG TGGACAAGTT CGCGTTGGTG ATGAAGTAGA 400 

AATCGTTGGT ATTGCTGACG AAACTGCTAA AACAACTGTA ACAGGTGTTG 450 

AAATGTTCCG TAAATTGTTA GACTATGCTG AAGCAGGGGA TAACATTGGT 500 

45 GCATTGCTAC GTGGGGTTGC TCGTGAAGAC ATCCAACGTG GACAAGTATT 550 

GGCTAAAGCT GGTACAATCA CACCTCATAC AAAATTCAAA GCTGAAGTTT 600 

ATGTTTTGAC AAAAGAAGAA GGTGGACGTC ACACTCCATT CTT 643 



50 

2) INFORMATION FOR SEQ ID NO: 873 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 641 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: Genomic DNA 

490 

SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R691 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 873 

GTCCTATGCC TCAAACTCGT GAACACATCT TGTTATCACG TAACGTTGGC 50 

GTACCATACA TCGTTGTTTT CTTGAACAAA ATGGATATGG TTGATGACGA 100 

AGAATTGCTA GAATTAGTTG AAATGGAAGT TCGTGACCTA TTGTCTGAAT 150 

ATGACTTCCC AGGCGACGAT GTTCCTGTAA TCGCCGGTTC TGCTTTGAAA 200 

GCTCTTGAAG GAGATCCTTC ATACGAAGAA AAAATCATGG AATTGATGGC 250 

TGCAGTTGAC GAATACGTTC CAACTCCAGA ACGTGATACT GACAAACCAT 3 00 

TCATGATGCC AGTCGAAGAC GTATTCTCAA TCACTGGACG TGGTACTGTT 350 

GCTACAGGCC GTGTTGAACG TGGACAAGTT CGCGTTGGTG ATGAAGTAGA 4 00 

AATCGTTGGT ATTGCTGACG AAACTGCTAA AACAACTGTA ACAGGTGTTG 450 

AAATGTTCCG TAAATTGTTA GACTATGCTG AAGCAGGGGA TAACATTGGT 500 

GCATTGCTAC GTGGGGTTGC TCGTGAAGAC ATCCAACGTG GACAAGTATT 550 

GGCTAAAGCT GGTACAATCA CACCTCATAC AAAATTCAAA GCTGAAGTTT 600 

ATGTTTTGAC AAAAGAAGAA GGTGGACGTC ACACTCCATT C 641 



2) INFORMATION FOR SEQ ID NO: 874 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 681 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: LSPQ 2514 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 874 

ACCAGCATTA GTAGTATTCT TAAATAAAGT TGACATGGTT GACGATGAAG 50 

AATTATTAGA ATTAGTTGAA ATGGAAGTAC GTGACTTATT ATCTGAATAC 100 

GACTTCCCAG GTGACGATGT ACCTGTAATC GCTGGTTCAG CATTAAAAGC 150 

TTTAGAAGGC GATGCTCAAT ACGAAGAAAA AATCTTAGAA TTAATGCAAG 200 

CAGTTGATGA CTACATTCCA ACTCCAGAAC GTGATTCTGA CAAACCATTC 250 

ATGATGCCAG TTGAGGACGT ATTCTCAATC ACTGGTCGTG GTACTGTTGC 300 

TACAGGCCGT GTTGAACGTG GGCAAATCAA AGTTGGTGAA GAAGTTGAAA 350 

TCATTGGTAT CCATGACACT TCTAAAACAA CTGTTACTGG TGTAGAAATG 400 

TTCCGTAAAT TATTAGACTA CGCTGAAGCT GGTGACAACA TCGGTGCATT 450 

ATTACGTGGT GTTGCTCGTG AAGACGTACA ACGTGGTCAA GTATTAGCTG 500 

CTCCAGGTTC AATCACACCT CACACAAAAT TTAAAGCAGA CGTATACGTT 550 

TTATCTAAAG ACGAAGGTGG ACGTCACACT CCATTCTTCA CAAACTATCG 600 

TCCACAATTC TATTTCCGTA CTACTGACGT AACTGGTGTT GTTAACTTAC 650 

CAGAAGGTAC TGAAATGGTT ATGCCTGGCG A 681 



2) INFORMATION FOR SEQ ID NO: 875 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 675 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: R591 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 875 

ATTATCACGT AACGTTGGTG TACCAGCATT AGTTGTATTC TTAAACAAAG 50 

TTGACATGGT AGACGACGAA GAATTATTAG AATTAGTTGA AATGGAAGTT 100 

CGTGACTTAT TAAGCGAATA TGACTTCCCA GGTGACGATG TACCTGTAAT 150 

15 CGCTGGTTCT GCATTAAAAG CATTAGAAGG CGATGCTGAA TACGAACAAA 200 

AAATCTTAGA CTTAATGCAA GCAGTTGATG ATTACATTCC AACTCCAGAA 250 

CGTGATTCTG ACAAACCATT CATGATGCCA GTTGAGGACG TATTCTCAAT 300 

CACTGGTCGT GGTACTGTTG CTACAGGCCG TGTTGAACGT GGTCAAATCA 350 

AAGTTGGTGA AGAAGTTGAA ATCATCGGTA TGCACGAAAC TTCTAAAACA 400 

20 ACTGTTACTG GTGTAGAAAT GTTCCGTAAA .TTATTAGACT ACGCTGAAGC 450 

TGGTGACAAC ATCGGTGCTT TATTACGTGG TGTTGCACGT GAAGACGTAC 500 

AACGTGGTCA AGTATTAGCT GCTCCTGGTT CTATTACACC ACACACAAAA 550 

TTCAAAGCTG AAGTATACGT ATTATCTAAA GATGAAGGTG GACGTCACAC 600 

TCCATTCTTC ACTAACTATC GCCCACAATT CTATTTCCGT ACTACTGACG 650 

25 TAACTGGTGT TGTAAACTTA CCAGA 675 



2) INFORMATION FOR SEQ ID NO: 876 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 704 bases 

(B) ' TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: CSG 10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 876 

45 TCTTATTATC ACGTAACGTT GGTGTACCAG CATTAGTTGT ATTCTTAAAC 50 

AAAGTTGACA TGGTAGACGA CGAAGAATTA TTAGAATTAG TTGAAATGGA 100 

AGTTCGTGAC TTATTAAGCG AATATGACTT CCCAGGTGAC GATGTACCTG 150 

TAATCGCTGG TTCTGCATTA AAAGCATTAG AAGGCGATGC TGAATACGAA 200 

CAAAAAATCT TAGACTTAAT GCAAGCAGTT GATGATTACA TTCCAACTCC 250 

50 AGAACGTGAT TCTGACAAAC CATTCATGAT GCCAGTTGAG GACGTATTCT 300 

CAATCACTGG TCGTGGTACT GTTGCTACAG GCCGTGTTGA ACGTGGTCAA 350 

ATCAAAGTTG GTGAAGAAGT TGAAATCATC GGTATGCACG AAACTTCTAA 400 

AACAACTGTT ACTGGTGTAG AAATGTTCCG TAAATTATTA GACTACGCTG 450 

AAGCTGGTGA CAACATCGGT GCTTTATTAC GTGGTGTTGC ACGTGAAGAC 500 

55 GTACAACGTG GTCAAGTATT AGCTGCTCCT GGTTCTATTA CACCACACAC 550 

AAAATTCAAA GCTGAAGTAT ACGTATTATC TAAAGATGAA GGTGGACGTC 600 

ACACTCCATT CTTCACTAAC TATCGCCCAC AATTCTATTT CCGTACTACT 650 

GACGTAACTG GTGTTGTAAA CTTACCAGAA GGTACAGAAA TGGTTATGCC 700 

TGGC 704 



60 



492 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 877 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 770 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 35984 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 877 

TCTTAGTTGT ATCTGCTGCT GACGGTCCAA TGC CACAAAC TCGTGAACAC 50 

ATCTTATTAT CACGTAACGT TGGTGTACCA GCATTAGTTG TATTCTTAAA 100 

CAAAGTTGAC ATGGTAGACG ACGAAGAATT ATTAGAATTA GTTGAAATGG 150 

AAGTTCGTGA CTTATTAAGC GAATATGACT TCCCAGGTGA CGATGTACCT 200 

GTAATCGCTG GTTCTGCATT AAAAGCATTA GAAGGCGATG CTGAATACGA 250 

ACAAAAAATC TTAGACTTAA TGCAAGCAGT TGATGATTAC ATTCCAACTC 3 00 

CAGAACGTGA TTCTGACAAA CCATTCATGA TGCCAGTTGA GGACGTATTC 350 

TCAATCACTG GTCGTGGTAC TGTTGCTACA GGCCGTGTTG AACGTGGTCA 400 

AATCAAAGTT GGTGAAGAAG TTGAAATCAT CGGTATGCAC GAAACTTCTA 450 

AAACAACTGT TACTGGTGTA GAAATGTTCC GTAAATTATT AGACTACGCT 500 

GAAGCTGGTG ACAACATCGG TGCTTTATTA CGTGGTGTTG CACGTGAAGA 550 

CGTACAACGT GGTCAAGTAT TAGCTGCTCC TGGTTCTATT ACACCACACA 600 

CAAAATTCAA AGCTGAAGTA TACGTATTAT CTAAAGATGA AGGTGGACGT 650 

CACACTCCAT TCTTCACTAA CTATCGCCCA CAATTCTATT TCCGTACTAC 700 

TGACGTAACT GGTGTTGTAA ACTTACCAGA AGGTACAGAA ATGGTTATGC 750 

CTGGCGACAA CGTTGAAATG 770 



2 ) INFORMATION FOR SEQ ID NO: 878 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 716 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 35983 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 878 

TTGTATTCTT AAACAAAGTT GACATGGTAG ACGACGAAGA ATTATTAGAA 50 

TTAGTTGAAA TGGAAGTTCG TGACTTATTA AGCGAATATG ACTTCCCAGG 100 

TGACGATGTA CCTGTAATCG CTGGTTCTGC ATTAAAAGCA TTAGAAGGCG 150 

ATGCTGAATA CGAACAAAAA ATCTTAGACT TAATGCAAGC AGTTGATGAT 2 00 

TACATTCCAA CTCCAGAACG TGATTCTGAC AAACCATTCA TGATGCCAGT 250 

TGAGGACGTA TTCTCAATCA CTGGTCGTGG TACTGTTGCT ACAGGCCGTG 300 

TTGAACGTGG TCAAATCAAA GTTGGTGAAG AAGTTGAAAT CATCGGTATG 350 
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CACGAAACTT CTAAAACAAC 
ATTAGACTAC GCTGAAGCTG 
TTGCACGTGA AGACGTACAA 
ATTACACCAC ACACAAAATT 
TGAAGGTGGA CGTCACACTC 
ATTTCCGTAC TACTGACGTA 
GAAATGGTTA TGCCTGGCGA 
TCCAATCGCT ATCGAA 



TGTTACTGGT 
GTGACAACAT 
CGTGGTCAAG 
CAAAGCTGAA 
CATTCTTCAC 
ACTGGTGTTG 
CAACGTTGAA 



GTAGAAATGT TCCGTAAATT 
CGGTGCTTTA TTACGTGGTG 
TATTAGCTGC TCCTGGTTCT 
GTATACGTAT TATCTAAAGA 
TAACTATCGC CCACAATTCT 
TAAACTTACC AGAAGGTACA 
ATGACAGTTG AATTAATCGC 



400 
450 
500 
550 
600 
650 
700 
716 
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20 



25 



30 



35 



40 



45 
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2) INFORMATION FOR SEQ ID NO: 879 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 640 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R764 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 879 

CGGTCCTATG CCTCAAACTC GTGAACACAT CTTGTTATCA CGTAACGTTG 50 

GCGTACCATA CATCGTTGTT TTCTTGAACA AAATGGATAT GGTTGATGAC 100 

GAAGAATTGC TAGAATTAGT TGAAATGGAA GTTCGTGACC TATTGTCTGA 150 

ATATGACTTC CCAGGCGACG ATGTTCCTGT AATCGCCGGT TCTGCTTTGA 200 

AAGCTCTTGA AGGAGATCCT TCATACGAAG AAAAAATCAT GGAATTGATG 250 

GCTGCAGTTG ACGAATACGT TCCAACTCCA GAACGTGATA CTGACAAACC 3 00^ 

ATTCATGATG CCAGTCGAAG -ACGTATTCTC AATCACTGGA CGTGGTACTG 350 

TTGCTACAGG CCGTGTTGAA CGTGGACAAG TTCGCGTTGG TGATGAAGTA 400 

GAAATCGTTG GTATTGCTGA CGAAACTGCT AAAACAACTG TAACAGGTGT 450 

TGAAATGTTC CGTAAATTGT TAGACTATGC TGAAGCAGGG GATAACATTG 500 

GTGCATTGCT ACGTGGGGTT GCTCGTGAAG ACATCCAACG TGGACAAGTA 550 

TTGGCTAAAG CTGGTACAAT CACACCTCAT ACAAAATTCA AAGCTGAAGT 600 

TTATGTTTTG ACAAAAGAAG AAGGTGGACG TCACACTCCA 640 



2) INFORMATION FOR SEQ ID NO: 880 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 

(B) STRAIN: PAO-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 880 
CGGCGCGATC CTGGTTTGCT CGGCTGCCGA CGGCCCCATG CCGCAGACCC 50 



494 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



GCGAGCACAT CCTGCTGTCC 
TTCCTGAACA' AGGCCGACAT 
CGAGATGGAA GTTCGCGATC 
ACACTCCGAT CATCATCGGT 
GACAACGGCA TCGGCGTAAG 
CTCCTACATT CCGGAGCCGG 
CGATCGAAGA CGTGTTCTCG 
CGTGTAGAGC GCGGCATCAT 
CATCAAGGCG ACCACCAAGA 
AGCTGCTCGA CGAAGGTCGT 
GGCACCAAGC GTGAAGACGT 
CACCATCAAG CCGCACACCA 
AGGAAGAAGG TGGTCGTCAC 
TTCTACTTCC GTACCACCGA 
CGTAGAGATG GTAATGCCGG 
TCGCTCCGAT CGCCATGGAA 



CGCCAGGTAG 
GGTCGACGAC 
TGCTGAACAC 
TCCGCGCTGA 
CGCCGTGCAG 
TTCGTGCCAT 
ATCTCCGGCC 
CAAGGTCCAG 
CTACCTGCAC 
GCTGGTGAGA 
AGAGCGTGGC 
AGTTCGAGTG 
ACCCCGTTCT 
YGTGACCGGT 
GCGACAACAT 
GATGGCCTGC 



GCGTTCCCTA 
GCCGAGCTGC 
CTACGACTTC 
TGGCGCTGGA 
AAGCTGGTAG 
CGACCAGCCG 
GCGGTACCGT 
GAAGAAGTGG 
CGGCGTTGAA 
ACGTTGGTAT 
CAGGTTCTGG 
CGAAGTGTAC 
TCAAGGGCTA 
AACTGCGAAC 
CAAGATGGTT 
G 



CATCGTC&TG 
TGGAACTGGT 
CCGGGCGACG 
AGGCAAGGAT 
AGACCCTGGA 
TTCCTGATGC 
GGTAACCGGT 
AAATCGTCGG 
ATGTTCCGCA 
CCTGCTGCGT 
CCAAGCCGGG 
GTGCTGTCCA 
CCGTCCGCAG 
TGCCGGAAGG 
GTCACCCTGA 



100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
831 



2) INFORMATION FOR SEQ ID NO: 881 

' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: R421 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 881 

CCTATGCCTC AAACACGTGA ACACATCTTG TTATCACGTA ACGTTGGTGT 50 

ACCATACATC GTTGTTTTCT TAAACAAAAT GGATATGGTT GATGACGAAG 100 

AATTACTAGA ATTAGTTGAA ATGGAAGTTC GTGACTTATT GTCAGAATAT 150 

GACTTCCCAG GCGACGATGT TCCTGTAATC GCTGGTTCTG CTTTGAAAGC 200 

TCTTGAAGGC GATGCTTCAT ACGAAGAAAA AATCATGGAA TTAATGGCTG 250 

CAGTTGACGA ATACGTTCCA ACTCCAGAAC GTGACACTGA CAAACCATTC 300 

ATGATGCCAG TCGAAGACGT ATTCTCAATC ACTGGACGTG GTACTGTTGC 350 

TACAGGCCGT GTTGAACGTG GACAAGTTCG CGTTGGTGAC GAAGTTGAAA 400 

TCGTTGGTAT TGCTGAAGAA ACTGCTAAAA CAACTGTAAC TGGTGTTGAA 450 

ATGTTCCGTA AATTGTTAGA CTATGCTGAA GCAGGGGATA ACATTGGTGC 500 

ATTGCTACGT GGTGTTGCTC GTGAAGACAT CCAACGTGGA CAAGTATTGG 550 

CTAAAGCTGG TACAATCACA CCTCATACAA AATTTAAAGC TGAAGTTTAC 600 

GTTTTAACAA AAGAAGAAGG TGGACGTCAC ACACCATTCT TC 642 



2) INFORMATION FOR SEQ ID NO: 882 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 636 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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5 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: R775 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 882 



CCTATGCCTC AAACACGTGA ACACATCTTG TTATCACGTA ACGTTGGTGT 50 

ACCATACATC GTTGTTTTCT TAAACAAAAT GGATATGGTT GATGACGAAG 100 

10 AATTACTAGA ATTAGTTGAA ATGGAAGTTC GTGACTTATT GTCAGAATAT 150 

GACTTCCCAG GCGACGATGT TCCTGTAATC GCTGGTTCTG CTTTGAAAGC 200 

TCTTGAAGGC GATGCTTCAT ACGAAGAAAA AATCATGGAA TTAATGGCTG 250 

CAGTTGACGA ATACGTTCCA ACTCCAGAAC GTGACACTGA CAAACCATTC 3 00 

ATGATGCCAG TCGAAGACGT ATTCTCAATC ACTGGACGTG GTACTGTTGC 350 

15 TACAGGCCGT GTTGAACGTG GACAAGTTCG CGTTGGTGAC GAAGTTGAAA 400 

TCGTTGGTAT TGCTGAAGAA ACTGCTAAAA CAACTGTAAC TGGTGTTGAA 450 

ATGTTCCGTA AATTGTTAGA CTATGCTGAA GCAGGGGATA ACATTGGTGC 500 

ATTGCTACGT GGTGTTGCTC GTGAAGACAT CCAACGTGGA CAAGTATTGG 550 

CTAAAGCTGG TACAATCACA CCTCATACAA AATTTAAAGC TGAAGTTTAC 600 

20 GTTTTAACAA AAGAAGAAGG TGGACGTCAT ACACCA 636 



2) INFORMATION FOR SEQ ID NO: 883 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 641 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: R422 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 883 

40 GGTCCTATGC CTCAAACACG TGAACATATC TTATTATCAC GTAACGTTGG 50 

TGTACCATAC ATCGTTGTAT TCTTAAACAA AATGGATATG GTTGATGACG 100 

AAGAATTATT AGAATTAGTA GAAATGGAAG TTCGTGACTT ATTATCAGAA 150 

TACGATTTCC CAGGCGATGA TGTTCCAGTT ATCGCAGGTT CTGCTTTGAA 2 00 

AGCTTTAGAA GGCGACGAGT CTTATGAAGA AAAAATCTTA GAATTAATGG 250 

45 CTGCAGTTGA CGAATATATC CCAACTCCAG AACGTGATAC TGACAAACCA 3 00 

TTCATGATGC CAGTCGAAGA CGTATTCTCA ATCACTGGAC GTGGTACTGT 350 

TGCTACAGGC CGTGTTGAAC GTGGTGAAGT TCGCGTTGGT GACGAAGTTG 400 

AAATCGTTGG TATTAAAGAC GAAACATCTA AAACAACTGT TACAGGTGTT 4 50 

GAAATGTTCC GTAAATTATT AGACTACGCT GAAGCAGGCG ACAACATCGG 500 

50 TGCTTTATTA CGTGGTGTAG CACGTGAAGA TATCGAACGT GGACAAGTAT 550 

TAGCTAAACC AGCTACAATC ACTCCACACA CAAAATTCAA AGCTGAAGTA 600 

TACGTATTAT CAAAAGAAGA AGGCGGACGT CACACTCCAT T 641 



55 

2) INFORMATION FOR SEQ ID NO: 884 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 64 0 bases 
60 (B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: R575 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 884 

TATGCCTCAA ACACGTGAAC ATATCTTATT ATCACGTAAC GTTGGTGTAC 50 

CATACATCGT TGTATTCTTA AACAAAATGG ATATGGTTGA TGACGAAGAA 100 

TTATTAGAAT TAGTAGAAAT GGAAGTTCGT GACTTATTAT CAGAATACGA 150 

15 TTTCCCAGGC GATGATGTTC CAGTTATCGC AGGTTCTGCT TTGAAAGCTT 2 00 

TAGAAGGCGA CGAGTCTTAT GAAGAAAAAA TCTTAGAATT AATGGCTGCA 250 

GTTGACGAAT ATATCCCAAC TCCAGAACGT GATACTGACA AACCATTCAT 3 00 

GATGCCAGTC GAAGACGTAT TCTCAATCAC TGGACGTGGT ACTGTTGCTA 350 

CAGGCCGTGT TGAACGTGGT GAAGTTCGCG TTGGTGACGA AGTTGAAATC 4 00 

20 GTTGGTATTA AAGACGAAAC ATCTAAAACA ACTGTTACAG GTGTTGAAAT 4 50 

GTTCCGTAAA TTATTAGACT ACGCTGAAGC AGGCGACAAC ATCGGTGCTT 500 

TATTACGTGG TGTTGCACGT GAAGATATCG AACGTGGACA AGTATTAGCT 550 

AAACCAGCTA CAATCACTCC ACACACAAAA TTCAAAGCTG AAGTATACGT 600 

ATTATCAAAA GAAGAAGGCG GACGTCACAC TCCATTCTTC 640 

25 



2) INFORMATION FOR SEQ ID NO: 885 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 632 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
40 (B) STRAIN: R492 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 885 

TGCCTCAAAC TCGTGAACAC ATCCTATTGT CTCGTCAAGT TGGTGTTCCT 50 

45 TACATCGTTG TATTCTTGAA CAAAGTAGAC ATGGTTGATG ACGAAGAATT 100 

ACTAGAATTA GTTGAAATGG AAGTTCGTGA CCTATTAACA GAATACGAAT 150 

TCCCTGGTGA CGATGTTCCT GTAGTTGCTG GATCAGCTTT GAAAGCTCTA 2 00 

GAAGGCGACG CTTCATACGA AGAAAAAATT CTTGAATTAA TGGCTGCAGT 2 50 

TGACGAATAC ATCCCAACTC CAGAACGTGA CAACGACAAA CCATTCATGA 3 00 

50 TGCCAGTTGA AGACGTGTTC TCAATTACTG GACGTGGTAC TGTTGCTACA 3 50 

GGTCGTGTTG AACGTGGACA AGTTCGCGTT GGTGACGAAG TTGAAGTTGT 4 00 

TGGTATTGCT GAAGAAACTT CAAAAACAAC AGTTACTGGT GTTGAAATGT 4 50 

TCCGTAAATT GTTAGACTAC GCTGAAGCTG GAGACAACAT TGGTGCTTTA 500 

CTACGTGGTG TTGCACGTGA AGACATCCAA CGTGGACAAG TTTTAGCTAA 550 

55 ACCAGGTACA ATCACACCTC ATACAAAATT CTCTGCAGAA GTATACGTGT 600 

TGACAAAAGA AGAAGGTGGA CGTCATACTC CA 632 



60 2) INFORMATION FOR SEQ ID NO: 886 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 640 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R576 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 886 

CGGCCCAATG CCTCAAACTC GTGAACACAT CCTATTGTCT CGTCAAGTTG 50 

GTGTTCCTTA CATCGTTGTA TTCTTGAACA AAGTAGACAT GGTTGATGAC 100 

GAAGAATTAC TAGAATTAGT TGAAATGGAA GTTCGTGACC TATTAACAGA 150 

ATACGAATTC CCTGGTGACG ATGTTCCTGT AGTTGCTGGA TCAGCTTTGA 200 

AAGCTCTAGA AGGCGACGCT TCATACGAAG AAAAAATTCT TGAATTAATG 250 

GCTGCAGTTG ACGAATACAT CCCAACTCCA GAACGTGACA ACGACAAACC 3 00 

ATTCATGATG CCAGTTGAAG ACGTGTTCTC AATTACTGGA CGTGGTACTG 3 50 

TTGCTACAGG TCGTGTTGAA CGTGGACAAG TTCGCGTTGG TGACGAAGTT 4 00 

GAAGTTGTTG GTATTGCTGA AGAAACTTCA AAAACAACAG TTACTGGTGT 450 

TGAAATGTTC CGTAAATTGT TAGACTACGC TGAAGCTGGA GACAACATTG 500 

GTGCTTTACT ACGTGGTGTT GCACGTGAAG ACATCCAACG TGGACAAGTT 550 

TTAGCTAAAC CAGGTACAAT CACACCTCAT ACAAAATTCT CTGCAGAAGT 600 

ATACGTGTTG ACAAAAGAAG AAGGTGGACG TCATACTCCA 64 0 



2) INFORMATION FOR SEQ ID NO: 887 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zoogloea rami g era 

(B) STRAIN: ATCC 25935 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 887 

ATCCTGGTTT GCTCCGCAGC TGACGGCCCA ATGCCACAGA CCCGCGAGCA 50 

CATCCTGCTG GCCCGCCAAG TTGGCGTTCC ATACATCATC GTGTTCCTGA 100 

ACAAGTGCGA CCTGGTTGAC GACGCAGAAC TGCTGGAACT GGTCGAAATG 150 

GAAGTGCGTG AATTGCTGTC GAAATACGAG TTCCCAGGCG ACGACGTACC 2 00 

AATCATCAAG GGTTCGGCAC GTATGGCGCT GGAAGGCAAA GAAGGCGAGA 250 

TGGGCGTTGA CGCCATCATG CGTCTGGCCG ATGCACTGGA CAGCTACATC 3 00 

CCTACGCCAG AGCGCGCAGT CGATGGCGCC TTCCTGATGC CAGTGGAAGA 3 50 

CGTGTTCTCG ATCTCGGGTC GCGGTACCGT TGTGACCGGT CGTATCGAGC 4 00 

GCGGCGTGAT CAAGGTCGGC GAAGAGATCG AAATCGTCGG CATTATCGAC 450 

ACCGTCAAAA CCACTTGCAC CGGCGTGGAA ATGTTCCGCA AGCTGCTGGA 500 

CCAGGGTCAA GCCGGCGACA ACGTTGGTCT GCTGCTGCGC GGCACCAAGC 550 

GTGAAGACGT ACAGCGTGGT CAGGTTCTGG CCAAGCCAGC GTCGATCAAG 600 

CCGCACAACC ACTTCACCGG CGAGATCTAC GTTCTGTCGA AAGATGAAGG 650 
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CGGCCGTCAC ACCCCGTTCT 
GTACGACTGA CGTGACCGGT 
GTCATGCCAG GCGACAACGT 
CGCGAT 



TCAACAACTA TCGTCCACAG 
TCGATCGAAC TGCCAGCAGA 
GTCGATCACC GTCAAGCTGA 
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TTCTACTTCC " 7 OCT 
CAAAGAAATG 750 
TCAACCCGAT 800 

806 



2) INFORMATION FOR SEQ ID NO: 888 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 634 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecal is 

(B) STRAIN: R503 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 888 

TATGCCTCAA ACACGTGAAC ATATCTTATT ATCACGTAAC GTTGGTGTAC 50 

CATACATCGT TGTATTCTTA AACAAAATGG ATATGGTTGA TGACGAAGAA 100 

TTATTAGAAT TAGTAGAAAT GGAAGTTCGT GACTTATTAT CAGAATACGA 150 

TTTCCCAGGC GATGATGTTC CAGTTATCGC AGGTTCTGCT TTGAAAGCTT 2 00 

TAGAAGGCGA CGAGTCTTAT GAAGAAAAAA TCTTAGAATT AATGGCTGCA 2 50 

GTTGACGAAT ATATCCCAAC TCCAGAACGT GATACTGACA AACCATTCAT 300 

GATGCCAGTC GAAGACGTAT TCTCAATCAC TGGACGTGGT ACTGTTGCTA 350 

CAGGCCGTGT TGAACGTGGT GAAGTTCGCG TTGGTGACGA AGTTGAAATC 400 

GTTGGTATTA AAGACGAAAC ATCTAAAACA ACTGTTACAG GTGTTGAAAT 450 

GTTCCGTAAA TTATTAGACT ACGCTGAAGC AGGCGACAAC ATCGGTGCTT 500 

TATTACGTGG TGTAGCACGT GAAGATATCG AACGTGGACA AGTATTAGCT 550 

AAACCAGCTA CAATCACTCC ACACACAAAA TTCAAAGCTG AAGTATACGT 600 

AT TATC AAAA GAAGAAGGCG GACGTCACAC TCCA 634 



2) INFORMATION FOR SEQ ID NO: 889 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: ATCC 14110 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 889 

TGTCTTCATC CAGGAATTGA TTGTGAGTCG TTCCACATGC TCACCTAGTT 50 

TTCGCTCGAT CTTTTCACTA ACGCAAACCA TGTAGAACAA CATTGCCAAG 100 

GCCCACGGTG GTTACTCCGT CTTCACTGGT GTTGGTGAGC GTACTCGTGA 150 

GGGTAACGAT CTGTACCACG AAATGCAGGA GACTGGTGTC ATTCAGCTCG 2 00 

AGGGTGAATC CAAGGTCGCA CTGGTGTTCG GACAGATGAA CGAGCCCCCC 250 
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GGTGCCCGTG CCCGTGTCGC CCTTACCGGT 
CCGTGACGAG GAGGGTCAGG ACGTGCTGCT 
GTTTCACCCA GGCCGGTTCT GAGGTGTCTG 
TCTGCCGTCG GTTACCAGCC CACCCTGGCC 
5 GGAGCGTATC ACCACCACCA AGAAGGGTTC 



PCT/CA00/01150 

CTGACCATTG CCGAGTACTT 3 00 

CTTCATTGAC AACATTTTCC 350 

CCCTTCTCGG TCGTATCCCC 400 

GTCGACATGG GTGGTATGCA 450 

TATTACCTCC GTC 493 



2) INFORMATION FOR SEQ ID NO: 890 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 66 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Penicillium marneffei 

(B) STRAIN: ATCC 64101 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 890 

25 TTGATTGTAC GTCTTTACCT TTCTGCCTGA CTGTTTACGA CAACTAACGA 50 

AAGCGTAGAA CAACATTGCC AAGGCTCACG GTGGTTACTC TGTCTTCACT 100 

GGTGTCGGTG AACGTACTCG TGAGGGTAAC GATTTGTACC ACGAAATGCA 150 

GGAAACTGGT GTCATTCAGC TCGAGGGTGA ATCCAAGGTC GCCCTCGTGT 2 00 

TCGGTCAGAT GAACGAGCCC CCCGGTGCCC GTGCCCGTGT CGCTCTTACT 250 

30 GGTTTGACCA TTGCCGAGTA CTTCCGTGAC GAGGAAGGTC AGGACGTGCT 3 00 

TCTCTTCATT GACAACATTT TCCGTTTCAC TCAGGCCGGT TCTGAGGTGT 350 

CTGCCCTTCT GGGTCGTATC CCCTCTGCCG TCGGTTACCA GCCCACCCTT 4 00 

GCCGTCGACA TGGGTATCAT GCAGGAGCGT ATTACCACCA CCACCAAGGG 450 

TTCCATCACC TCCGTC 4 66 

35 



2) INFORMATION FOR SEQ ID NO: 891 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 78 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paecilomyces lilacinus 
50 (B) STRAIN: ATCC 42570 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 891 

AGGAGCTGAT CAACAACATC GCCAAGGCTC ACGGTGGTTA CTCCGTCTTC 50 

55 ACTGGTGTCG GTGAGCGTAC CCGTGAGGGT AACGATCTGT ACCACGAAAT 100 

GCAGGAGACC TCGGTCATTC AGCTCGAGGG CGAGTCTAAG GTGGCCCTGG 150 

TCTTTGGTCA GATGAACGAG CCCCCGGGTG CTCGTGCCCG TGTCGCTCTT 200 

ACTGGTCTTA CCGTCGCCGA GTACTTCCGT GACCAGGAGG GTCAGGATGG 250 

TTAGTTCTCG TCCACTCATG CCGAAACATG TGCGTGTTCC GAGGCTAATC 3 00 

60 AACGTGCCAG TGCTGCTTTT CATCGACAAC ATTTTCCGAT TCACACAGGC 350 

500 
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CGGTTCCGAG GTGTCTGCCC TGCTGGGTCC?' Pttt'f CT* GCCGTtGGTT 
ACCAGCCCAC CCTCGCCGTC GACATGGGTG GCATGCAGGA GCGTATCACC 
ACCACCAAGA AGGGCTCTAT CACCTCCG 



400/ 

450 

478 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 892 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 481 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffei 

(B) STRAIN: ATCC 58950 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 892 

GTCTTTATCC AGGAGTTGAT TGTACGTCTT TACCTTTCTG CCTGACTGTT 50 

TACGACAACT AACGAAAGCG TAGAACAACA TTGCCAAGGC TCACGGTGGT 100 

TACTCTGTCT TCACTGGTGT CGGTGAACGT ACTCGTGAGG GTAACGATTT 150 

GTAC CACGAA ATGCAGGAAA CTGGTGTCAT TCAGCTCGAG GGTGAATCCA 200 

AGGTCGCCCT CGTGTTCGGT CAGATGAACG AGCCCCCCGG TGCCCGTGCC 250 

CGTGTCGCTC TTACTGGTTT GACCATTGCC GAGTACTTCC GTGACGAGGA 3 00 

AGGTCAGGAC GTGCTTCTCT TCATTGACAA CATTTTCCGT TTCACTCAGG 3 50 

CCGGTTCTGA GGTGTCTGCC CTTCTGGGTC GTATCCCCTC TGCCGTCGGT 400 

TACCAGCCCA CCCTTGCCGT CGACATGGGT ATCATGCAGG AGCGTATTAC 450 

CACCACCACC AAGGGTTCCA TCACCTCCGT C 481 



2 ) INFORMATION FOR SEQ ID NO: 8 93 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1208 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckii 

(B) STRAIN: ATCC 14285 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 893 

CTCGTCCGTG GTGCTAAGGC CACTGACACT GGTTSCCCCA TTACCATCCS 50 
CGTCGGCCCC GGTACCCTCG GTCGCATCAT GAACGTCACC GGTGACCCGA . 100 

TCGACGAGCG CGGTCCCATC AAGACCGACA AGTTCCGTCC CATCCACGCT 150 

GAGGCTCCCG AGTTCGTTGA CCAGTCGACC ACCGCTGAGG TTCTCGTGAC 200 

TGGTATCAAG GTCGTCGATC TGCTTGCTCC CTACGCCCGT GGTGGTAAGA 250 

TTGGTCTGTT TGGCGGTGCC GGTGTTGGCA AGACCGTGTT CATCCAGGAG 30 0 

CTCATCAACA ACATCGCCAA GGCCCACGGT GGTTACTCCG TCTTCACCGG 350 

TGTCGGCGAG CGTACCCGTG AGGGTAACGA TCTGTACCAC GAAATGCAGG 4 00 

AGACCTCTGT CATTCAGCTT GACGGTGACT CCAAGGTCGC CCTGGTGTTC 450 
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GGTCAGATGA ACGAGCCCCC TGGTGCTCGT GCCCGTGTCG CCTTGACClGG 500 

TTTGACTGTC GCTGAGTACT TCCGTGACGA GGAGGGCCAG GATGGTATGT 550 

TTTGAATTAT TTCCTTGTCG TACAGTTCCA AATCGAAGAA TTACTAACTT 600 

GTCAGTGCTT CTCTTCATCG ACAACATTTT CCGCTTCACC CAGGCCGGTT 650 

5 CTGAGGTGTC TGCCCTTCTG GGTCGTATTC CCTCCGCTGT CGGTTACCAG 700 

CCCACGCTCG CCGTGGACAT GGGTCTGATG CAGGAGCGTA TTACCACCAC 750 

CCGCAAGGGC TCAATTACCT CCGTCCAGGC CGTCTACGTG CCCGCTGACG 800 

ATCTGACGGA TCCCGCCCCC GCCACCACCT TCGCCCATCT GGACGCCACC 850 

ACTGTGCTGT CCCGAGGTAT CTCTGAGCTG GGTATCTACC CCGCTGTCGA 900 

10 CCCCCTCGAC TCCAAGTCGC GTATGCTGGA CCCCCGTATT GTCGGTGACG 950 

ACCACTACGA GACCGCCACT CGCGTCCAGC AGATCCTCCA AGAGTACAAG 1000 

TCGCTGCAGG ACATCATCGC CATTCTGGGT ATGGACGAGC TGTCTGAGGC 1050 

CGACAAGCTT ACAGTCGAGC GTGCTCGTAA GATCCAGCGT TTCCTGAGCC 1100 

AGCCGTTCAC GGTCGCGCAG GTCTTCACTG GTATCGAAGG CCAGCTGGTC 1150 

15 GATCTGAAGG ACACTATCGC TTCGTTCAAG GCTATCCTGA GCGGTGAGGG 1200 

TGACAGCC 1208 



20 2) INFORMATION FOR SEQ ID NO: 894 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malbranchea filamentosa 

(B) STRAIN: ATCC 48174 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 94 



TTCAGGAACT TATTGTAAGC CGCCCTCTTT ATGCATTGAG GGTGAATAAG 50 

AAGGCTGACA GGTAATAGAA CAACATTGCC AAGGCTCACG GTGGTTACTC 100 

CGTGTTCACT GGTGTCGGTG AGCGTACCCG TGAAGGAAAC GATCTGTACC 150 

ATGAAATGCA GGAAACCCGC GTCATCCAGC TTGATGGCGA GTCTAAGGTC 200 

40 GCCCTTGTGT TCGGTCAGAT GAACGAGCCC CCTGGAGCCC GTGCCCGTGT 250 

CGCTCTTACT GGTCTTACCG TTGCCGAATA CTTCCGTGAC GAGGAGGGCC 300 

AAGATGGTAC GCCTTTTTAC TCTTCTTATT CTTCGGGTCG GACTACAGAA 350 

CTAACCTGCT CCAGTGCTTC TCTTCATTGA TAACATTTTC CGTTTCACAC 400 

AAGCCGGTTC TGAGGTGTCT GCCTTGCTTG GACGTATTCC CTCTGCCGTC 450 

4 5 GGTTACCAGC CCACTCTCGC CGTCGACATG GGTGGTATGC AGGAACGTAT 500 

CACAACCACC AACAAGGGTT CCATTACTTC CGTG 534 



50 2) INFORMATION FOR SEQ ID NO: 895 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 448 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE; 
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(A) ORGANISM: Paecilomyces lilacinus 

(B) STRAIN: ATCC 60735 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 895 

CAAGGCTCAC GGTGGTTACT CCGTCTTCAC TGGTGTCGGT GAGCGTACCC 50 

GTGAGGGTAA CGATCTGTAC CACGAAATGC AGGAGACCTC GGTCATTCAG 100 

CTCGAGGGCG AGTCTAAGGT GGCCCTGGTC TTTGGTCAGA TGAACGAGCC 150 

CCCGGGTGCT CGTGCCCGTG TCGCTCTTAC TGGTCTTACC GTCGCCGAGT 200 

ACTTCCGTGA CCAGGAGGGT CAGGATGGTT AGTTCTCGTC CACTCATGCC 250 

GAAACATGTG CGTGTTCCGA GGCTAATCAA CGTGCCAGTG CTGCTTTTCA 3 00 

TCGACAACAT TTTCCGATTC ACCCAGGCCG GTTCCGAGGT GTCTGCCCTG 350 

CTGGGTCGTA TCCCCTCTGC CGTCGGTTAC CAGCCCACCC TCGCCGTCGA 400 

CATGGGTGGC ATGCAGGAGC GTATCAC C AC CACCAAGAAG GGCTCTAT 448 



2) INFORMATION FOR SEQ ID NO: 8 96 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 483 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus niger 

(B) STRAIN: ATCC 9508 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 896 



ATTCAGGAGC TGATTGTAAG TTGCCAATCC 
GACCCATAGA ACTAACAAAT TATTTAGAAC 
TGGTTACTCC GTCTTCTGTG GTGTCGGTGA 
ATCTGTACCA CGAAATGCAG GAGACTGGTG 
TCCAAGGTCG CTCTGGTCTT CGGTCAGATG 
TGCCCGTGTC GCCCTTACCG GTCTGACCAT 
AGGAGGGTCA GGACGTGCTG CTCTTCATTG 
CAGGCCGGTT CTGAGGTGTC TGCCCTTCTG 
CGGTTACCAG CCCACTCTGG CCGTCGACAT 
TTACCACCAC CACCAAGGGT TCCATTACCT 



ATGAACTGGA GATTTGGTGT 50 

AACATCGCCA AGGCTCACGG 100 

GCGTACTCGT GAGGGTAACG 150 

TCATCCAGCT CGAGGGTGAC 2 00 

AACGAGCCCC CGGGTGCCCG 250 

TGCCGAGTAC TTCCGTGACG 300 

ACAACATTTT CCGTTTCACC 3 50 

GGTCGTATCC CCTCTGCCGT 4 00 

GGGTGGTATG CAGGAGCGTA 450 

CCG 483 



2) INFORMATION FOR SEQ ID NO: 897 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1124 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fuxnigatus 

(B) STRAIN: ATCC 14110 

503 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAO0/01 150 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 897 

AAGCTCAAGT CCGAGCGTGA GCGTGGTATC ACCATCGACA TTGCCCTCTG 50 

GAAGTTCCAG ACTCCCAAGT ATGAGGTCAC TGTCATCGGT AAGCTCGACT 100 

5 CGCCCCGATA TGTTTTGGTG CTGTAGCTAA CACGATCTGA AGATGCCCCC 150 

GGTCACCGTG ACTTCATCAA GAACATGATC ACTGGTACCT CCCAGGCTGA 200 

CTGCGCTATC CTCATCATTG CCTCCGGTAC TGGTGAGTTC GAGGCTGGTA 250 

TCTCCAAGGA TGGCCAGACC CGTGAGCACG CTCTGCTGGC TTTCACCCTC 3 00 

GGTGTCAAGC AGCTCATCGT CGCCCTCAAC AAGATGGACA CCTGCAAGTG 3 50 

10 GTCCGAGGAT CGTTACAACG AAATTGTCAA GGAAACCTCC AACTTCATCA 4 00 

AGAAGGTCGG CTACAACCCC AAGGCCGTTC CCTTCGTCCC CATCTCTGGC 450 

TTCAACGGTG ACAACATGCT TGAGCCCTCC TCCAACTGCC CCTGGTACAA 500 

GGGATGGGAG AAGGAGACCA AGGCCGGCAA GGTCACTGGT AAGACCCTCA 550 

TCGAGGCCAT CGACGCCATT GAGCCCCCTG TCCGTCCCTY CAACAAGCCC 600 

15 CTCCGTCTTC CCCTCCAGGA TGTCTACAAG ATCTCTGGTA TCGGAACGGT 650 

CCCTGTCGGC CGTGTCGAGA CCGGTATCAT CAAGCCCGGC ATGGTCGTCA 700 

CCTTCGCCCC CGCCAACGTC AC C ACTGAAG TCAAGTCCGT CGAGATGCAC 750 

CACCAGCAGC TCCAGGAGGG TGTCCCCGGT GACAACGTCG GTTTCAACGT 800 

CAAGAACGTT TCCGTCAAGG AAGTCCGCCG TGGTAACGTC TGCGGTGACT 850 

20 CCAAGAACGA TCCCCCTCAG GGTGCTGCCT CCTTCAACGC CCAGGTCATC 900 

GTCCTCAACC ACCCCGGTCA GGTCGGCGCT GGTTACGCCC CCGTCCTCGA 950 

CTGCCACACT GCCCACATTG CTTGCAAGTT CTCTGAGCTG CTTGAGAAGA 1000 

TTGACCGCCG TACCGGCAAG TCTGTTGAGA ACAACCCCAA GTTCATCAAG 1050 

TCCGGTGATG CCGCCATCGT GAAGATGGTT CCTTCCAAGC CCATGTGTGT 1100 

25 CGAGTCCTTC ACTGACTACC CCCC 1124 



30 



2) INFORMATION FOR SEQ ID NO: 898 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1363 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Penicillium marneffei 

(D) STRAIN: WSA-214 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 898 

45 CTCAAGGCTG AGCGTGAGCG TGGTATCACC ATCGATATTG CTCTCTGGAA 50 

GTTCCAGACT GCCAAGTACG AGGTTACCGT CATTGACGCC CCCGGTCACC 100 

GTGATTTCAT CAAGAACATG ATCACTGGTA CCTCCCAGGC CGATTGCGCT 150 

ATTCTCATCA TTGCCTCTGG TACTGGTGAA TTCGAGGCTG GTATCTCCAA 2 00 

GGATGGCCAG ACTCGTGAGC ACGCTCTTTT GGCTTTCACC CTCGGTGTCC 250 

50 GTCAGCTCAT TGTTGCCCTC AACAAGATGG ACACTTGCAA GTGGTCTCAG 3 00 

GGTGAGTACT CGTACCTGCG TTTGGCCTTG AATATCTTAC TAATGCACCA 350 

TAGATCGTTA CAACGAAATT GTCAAGGAGA CTTCCAACTT CATCAAGAAG 400 

GTCGGATACA ACCCTAAGAA CGTTCCTTTC GTTCCTATCT CCGGTTTCAA 450 

CGGTGACAAC ATGCTTGAGC CCTCCCCCAA CTGCCCCTGG TACAAGGGTT 500 

55 GGGAGAAGGA GACCAAGGCC GGTAAGGTCA CTGGTAAGAC CCTCCTCGAG 550 

GCCATCGACG CCATTGAGCC CCCTACCCGT CCCGCCAACA AGGTTAGTCC 600 

CTCCTCGACT ACTCAAACCC TCCTCATAAG TTCATGATTA CGACTCGTTC 650 

ACAGCCCCTC CGTCTTCCCC TCCAGGATGT CTACAAGATC GGTGGTATTG 700 

GAACGGTTCC CGTCGGTCGT GTTGAGACTG GTACCATCGT TCCTGGTATG 750 

60 GTTGTCACCT TGTAAGTCAC TCTCCTCGCT TATCCTACCT GAAATCATCA 8 00 

504 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TGTGCTAACT TGACACTCAG CGCTCCCGCC 
GAGTGTTGAA ATGCACCACC AGCAGCTCAC 
ACGTTGGTTT CAACGTGAAG AACGTCTCCG 
AACGTTGCTG GTGACAGCAA GAACGACCCC 
5 CAACGCCCAG GTCATCGTCC TCAACCACCC 
ACGCCCCAGT CCTCGATTGC CACACTGCCC 
GAGCTCCTCG AGAAGATTGA CCGTCGTACC 
CCCCAAGTTC ATCAAGTCCG GTGACGCTGC 
CCAAGCCCAT GTGTGTTGAG GCTTTCACCG 
10 TTCGCCGTTC GCGAGTAAGT TTTATCTCCG 
CCTTCTCCTC CGTCTTCCAT ATATATTTTT 
ACAAATCACG GGA 



PCT/CA00/01150 

AACGTCACCA CTGAAGTbAA 850 

TGCCGGTCAG CCCGGTGACA 900 

TCAAGGAAAT CCGTCGTGGT 950 

CCTGCCGGTG CTGCCTCCTT 1000 

CGGTCAGGTC GGTGCTGGTT 1050 

ACATTGCTTG CAAGTTCGCT 1100 

GGAAAGTCTG TTGAGGACCA 1150 

CATCGTCAAG ATGATTCCTT • 1200 

AGTACCCTCC TCTCGGTCGT 1250 

TTGTCTATTT TCCATCCTTC 1300 

TCAGTTATAT GTGACTAACC 1350 

1363 



15 



2) INFORMATION FOR SEQ ID NO: 899 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1147 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Piedraia hortai 

(B) STRAIN: ATCC 24292 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 899 

CAAGCTGAAA GCCGAGCGTG AGCGTGGTAT CACTATCGAC ATTGCCCTCT 50 

GGAAGTTCGA GACTCCCAAG TACTATGTCA CCGTCATTGG TACGTCGCAT 100 

TATCTCACTC CTCACAGAAG CACGCTCCTA ACATCACACA GACGCTCCCG 150 

35 GTCACCGTGA TTTCATCAAG AACATGATCA CTGGTACCTC CCAGGCCGAC 2 00 

TGCGCTATCC TCATTATCGC TGCCGGTACT GGT&AGTTCG AGGCTGGTAT 250 

CTCCAAGGAT GGCCAGACTC GTGAGCACGC CCTGCTCGCC TACACCCTCG 3 00 

GTGTGCGTCA GCTCATCGTC GCCATCAACA AGATGGACAC CACCAAGTGG 350 

TCTGAGGCCC GTTACCAGGA GATCATCAAG GAGACCTCCA ACTTCATCAA 4 00 

40 GAAGGTCGGC TACAACCCCA AGACCGTCGC TTTCGTCCCC ATCTCTGGCT 450 

TCAACGGCGA CAACATGCTT GCCCCCTCCA CCAACTGCCC CTGGTACAAG 500 

GGATGGGAGC GTGAGGTCAA GGGCAACAAG CAGACCGGCA AGACCCTCCT 550 

CGAGGCCATT GACGGCATTG AGCCCCCCAA GCGTCCTTCC GACAAGCCCC 600 

TCCGTCTTCC TCTCCAGGAT GTTTACAAGA TCGGTGGTAT CGGAACTGTT 650 

45 CCTGTCGGCC GTATCGAGAC TGGTGTCCTC AAGCCCGGTA TGGTCGTTAC 700 

CTTCGCTCCC GCCAACGTCA CCACTGAAGT CAAGTCCGTC GAGATGCACC 750 

ACGAGCAGCT CACTGAGGGT CTTCCCGGTG ACAACGTCGG TTTCAACGTG 800 

AAGAACGTTT CCGTCAAGGA CATCCGCCGT GGTAACGTTG CCAGTGACTC 850 

CAAGAACGAC CCCGCTCTGG GTGCCGCTTC TTTCGACGCC CAGGTCATCG 900 

50 TCCTCAACCA CCCCGGTCAG GTCGGTGCTG GTTACGCCCC GGTCCTCGAT 950 

TGCCACACTG CCCACATTGC TTGCAAGTTC TCCGAGATCA AGGAGAAGAT 1000 

CGACCGCCGT ACCGGCAAGT CTGTTGAGGA CGCCCCCAAG TTCATCAAGT 1050 

CTGGTGACTC TGCCATCGTC AAGATGGTTC CCTCCAAGCC CATGTGCGTT 1100 

GAGGCTTTCA CCGACTACCC TCCTCTGGGC CGTTTCGCCG TCCGTGA 1147 

55 



2) INFORMATION FOR SEQ ID NO: 900 

60 (i) SEQUENCE CHARACTERISTICS: 

505 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

(A) LENGTH: 1150 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paecilomyces lilacinus 
10 (B) STRAIN: ATCC 60735 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 900 

CTCAAGGCCG AGCGTGAGCG TGGTATCACC ATCGACATTG CCCTCTGGAA 50 

15 GTTCGAGACT CCCAAGTACT ATGTCACCGT CATTGGTACG TCGACTCGCG 100 

CGAGACTGGT CGCAATTTCC ACGTCGCTAA CGTGCTTGAA CAGACGCTCC 150 

CGGCCACCGT GACTTCATCA AGAACATGAT CACTGGTACC TCCCAGGCTG 200 

ACTGCGCTAT CCTCATTATC GCTGCCGGCA CTGGTGAGTT CGAGGCTGGT 250 

ATCTCCAAGG ATGGCCAGAC CCGTGAGCAC GCTCTGCTCG CCTACACCCT 3 00 

2 0 CGGTGTTAAG CAGCTCATCG TCGCTATCAA CAAGATGGAC ACCACCAAGT 350 

GGTCTGAGGC CCGTTTCCAG GAGATCATCA AGGAGACCTC CAACTTCATC 4 00 

AAGAAGGTCG GCTACAACCC CAAGACCGTC GCTTTCGTCC CCATCTCTGG 450 

TTTCCACGGC GACAACATGC TTTCCCCCTC CACCAACTGC CCCTGGTACA 500 

AGGGCTGGGA GAAGGAGACC AAGGCTGGCA AGTCCACCGG CAAGACCCTC 550 

25 CTTGAGGCCA TCGACTCCAT CGAGCCCCCC AAGCGCCCCA GCGACAAGCC 600 

CCTCCGCCTT CCCCTTCAGG ATGTGTACAA GATCGGCGGT ATCGGCACAG 650 

TCCCTGTCGG CCGTATCGAG ACTGGTGTCA TCAAGCCCGG CATGGTCGTG 700 

ACCTTCGCTC CTTCCAACGT CACCACCGAA GTCAAGTCCG TTGAGATGCA 750 

CCACGAGCAG CTCTCCGAGG GTGTCCCCGG TGACAACGTC GGCTTCAACG 800 

3 0 TCAAGAACGT CTCCGTCAAG GAGATCCGTC GTGGCAACGT CGCCGGTGAC 850 

TCCAAGAACG ACCCCCCTCT GGGTGCCGCT TCTTTCGATG CCCAGGTCAT 900 

CGTCCTCAAC CACCCCGGCC AGGTCGGTGC TGGCTACGCC CCCGTCCTCG 950 

ACTGCCACAC CGCCCACATT GCCTGCAAGT TCGCCGAGAT CAAGGAGAAG 1000 

ATCGACCGCC GTACCGGCAA GTCTGTCGAG TCCGCCCCCA AGTTCATCAA 1050 

3 5 GTCTGGCGAC TCTGCCATCG TCAAGATGAT TCCCTCCAAG CCCATGTGCG 1100 

TTGAGGCTTT CACCGACTAC CCTCCTCTGG GCCGCTTCGC CGTCCGTGAC 1150 



40 2) INFORMATION FOR SEQ ID NO: 901 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 

(B) STRAIN: ATCC 32075 



55. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 901 



TACCACTAAG TGGTCCGAGA CCCGATTCAA TGAAATTATC AAGGAAGTCA 50 

CCAATTTCAT TAAGAAGGTC GGATATAACC CCAAGACTGT TCCTTTCGTT 100 

CCCATTTCTG GTTTCGAGGG TGACAACATG ATCGAGCCCT CTGCCAACTG 150 

CCCATGGTAC AAGGGCTGGT CCAAGGAGAC TGCTCAGGGC AAGTACTCTG 200 

60 GCAAGACCCT TCTTGAGGCC ATCGACGCCA TTGAGCCCCC CACCCGTCCT 250 

506 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



ACCGATAAAC CTCTCCGTCT 
TATTGGCACT GTTCCTGTCG 
GTATGGTCGT GACCTTCGCT 
GTCGAAATGC ACCACCAGCA 
CGGCTTCAAC GTCAAGAATG 
TTGCCGGTGA CTCTAAGAAT 
GCCCAGGTCA TCGTCCTCAA 
CCCAGTCCTC GACTGCCATA 
TCCTTGAGAA GATTGATCGA 
AAGTTCATCA AGTCCGGTGA 
G 



TCCCCTCCAG 
GACGTGTTGA 
CCCGCCAACG 
GCTTACCGCC 
TTTCCGTCAA 
GATCCCCCAA 
CCACCCTGGT 
CTGCCCACAT 
CGAACCGGAA 
TGCTGCTATC 



GATGTCTACA AGATCTCC^GG 
GACTGGAGTC ATCAAGCCCG 
TCACCACTGA AGTCAAGTCC 
GGTAACCCCG GTGACAACGT 
AGAAGTCCGC CGTGGTAACG 
AGGGCTGCGA TTCCTTCAAT 
CAGGTTGGCG CTGGTTATGC 
TGCCTGCAAA TTCGCTGAGC 
AGTCTGTTGA GAACAACCCC 
GTCAAGATGA TTCCTTCCAA 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
751 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 902 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1056 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckii 

(B) STRAIN: ATCC 7968 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 902 

CGTGAGCGCG GTATCACCAT CGATATTGCT CTGTGGAAGT TCGAGACCCC 50 

CAAGTACTAC GTCACCGTCA TTGACGCCCC CGGTCATCGC GATTTCATCA 100 

AGAACATGAT CACTGGTACC TCGCAGGCCG ACTGCGCCAT TCTCATCATT 150 

GCCGCTGGTA CTGGTGAGTT CGAGGCTGGT ATCTCCAAGG ATGGCCAGAC 2 00 

TCGTGAGCAC GCTCTGCTCG CCTACACCCT GGGTGTGCGG CAGCTGATCG 2 50 

TCGCCATCAA CAAGATGGAC ACGGCCAAGT GGGCTGAGGC TCGTTACCAG 300 

GAGATCATCA AGGAGACCTC CAACTTCATC AAGAAGGTCG GCTACAACCC 350 

CAAGACTGTT GCCTTCGTCC CCATCTCGGG CTTCCACGGC GACAACATGC 400 

TTACTCCCTC GACCAACTGC CCCTGGTACA AGGGCTGGGA GAAGGAGGGC 450 

AAGAGCGGCA AGGTTACCGG TAAGACTCTG CTGGACGCCA TTGACGCCGT 500 

CGAGCCCCCC AAGCGCCCCA CGGACAAGCC CCTGCGTCTG CCCCTCCAGG 550 

ATGTCTACAA GATCGGCGGT ATCGGCACTG TCCCTGTCGG CCGTATCGAG 600 

ACTGGTGTCC TGAAGCCCGG CATGGTCGTC ACCTTTGCCC CGTCCAACGT 650 

CACCACTGAA GTCAAGTCCG TCGAGATGCA CCACGAGCAG CTTGTTGAGG 700 

GTGTTCCCGG CGACAACGTC GGCTTCAACG TCAAGAACGT CTCCGTCAAG 750 

GAGATCCGTC GTGGCAACGT TGCCGGTGAC TCCAAGAACG ACCCCCCCTC 800 

GGGCGCCGCC ACCTTCAACG CCCAGGTCAT TGTCCTGAAC CACCCCGGCC 850 

AGGTCGGCAA CGGCTACGCC CCGGTTCTGG ACTGCCACAC CGCCCACATT 900 

GCCTGCAAGT TCACCGAGAT CCTTGAGAAG ATCGACCGCC GTACCGGCAA 950 

GTCGGTTGAG AACAACCCCA AGTTCATCAA GTCGGGTGAC GCCGCCATTG 1000 

TCAAGCTGAC GCCCTCGAAG CCCATGTGCG TTGAGGCCTT CACTGACTAC 1050 

CCCCCT 1056 



2) INFORMATION FOR SEQ ID NO: 903 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1366 bases 

(B) TYPE: Nucleic acid 

507 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicilliiun marneffei 

(B) STRAIN: ATCC 58950 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 903 

CAAGGCTGAG CGTGAGCGTG GTATCACCAT CGATATTGCT CTCTGGAAGT 50 

TCCAGACTGC CAAGTACGAG GTTACCGTCA TTGACGCCCC CGGTCACCGT 100 

GATTTCATCA AGAACATGAT CACTGGTACC TCCCAGGCCG ATTGCGCTAT 150 

15 TCTCATCATT GCCTCTGGTA CTGGTGAATT CGAGGCTGGT ATCTCCAAGG 200 

ATGGCCAGAC TCGTGAGCAC GCTCTTTTGG CTTTCACCCT CGGTGTCCGT 250 

CAGCTCATTG TTGCCCTCAA CAAGATGGAC ACTTGCAAGT GGTCTCAGGG 300 

TGAGTACTCG TACCTGCGTT TGGCCTTGAA TATCTTACTA ATGCACCATA 350 

GATCGTTACA ACGAAATTGT CAAGGAGACT TCCAACTTCA TCAAGAAGGT 400 

20 CGGATACAAC CCTAAGAACG TTCCTTTCGT TCCTATCTCC GGTTTCAACG 450 

GTGACAACAT GCTTGAGCCC TCCCCCAACT GCCCCTGGTA CAAGGGTTGG 500 

GAGAAGGAGA CCAAGGCCGG TAAGGTCACT GGTAAGACCC TCCTCGAGGC 550 

CATCGACGCC ATTGAGCCCC CTACCCGTCC CGCCAACAAG GTTAGTCCCT 600 

CCTCGACTAC TCAAACCCTC CTCATAAGTT CAGATTACTG ACTCGTTCAC 650 

25 AGCCCCTCCG TCTTCCCCTC CAGGATGTCT ACAAGATCGG TGGTATTGGA 700 

ACGGTTCCCG TCGGTCGTGT TGAGACTGGT ACCATCGTTC CTGGTATGGT 750 

TGTCACCTTG TAAGTCACTC TCCTCGCTTA TCCTACCTGA AATCATCATG 800 

TGCTAACTTG ACACTCAGCG CTCCCGCCAA CGTCACCACT GAAGTCAAGA 850 

GTGTTGAAAT GCACCACCAG CAGCTCACTG CCGGTCAGCC CGGTGACAAC 900 

30 GTTGGTTTCA ACGTGAAGAA CGTCTCCGTC AAGGAAATCC GTCGTGGTAA 950 

CGTTGCTGGT GACAGCAAGA ACGACCCCCC TGCCGGTGCT GCCTCCTTCA 1000 

ACGCCCAGGT CATCGTCCTC AACCACCCCG GTCAGGTCGG TGCTGGTTAC 1050 

GCCCCAGTCC TCGATTGCCA CACTGCCCAC ATTGCTTGCA AGTTCGCTGA 1100 

GCTCCTCGAG AAGATTGACC GTCGTACCGG AAAGTCTGTT GAGGACCACC 1150 

35 CCAAGTTCAT CAAGTCCGGT GACGCTGCCA TCGTCAAGAT GATTCCTTCC 1200 

AAGCCCATGT GTGTTGAGGC TTTCACCGAG TACCCTCCTC TCGGTCGTTT 1250 

CGCCGTTCGC GAGTAAGTTT TATCTCCGTT GTCTATTTTC CATCCTTCCC 1300 

TTCTCCTCCG TCTTCCATAT ATATTTTTTC AGTTATATGT GACTAACCAC 1350 

AAATCACGGG AATAGC 13 66 

40 



2) INFORMATION FOR SEQ ID NO: 904 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Curvularia lunata 
55 (B) STRAIN: ATCC 26425 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 904 

CTATTATCGT TGTTGCCGCT TCCGACGGTC AAATGCCCCA GACTCGTGAG 50 
60 CATCTGCTGC TCGCCCGCCA GGTCGGTGTC CAGAAGATCG TTGTCTTCGT 100 

508 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

CAACAAGGTC GATGCTGTTG AGGACAAGGA GATGTTGGAG CTCGTCGAGA 150 

TGGAGATGCG CGAATTGCTC AGCAGCTACG GCTTCGAGGG CGACGAGACT 200 

CCCATCATCA TGGGATCTGC CCTCTGCGCC ATTGAGGGCC GCGAACCTGA 250 

GATTGGTGTC AACCGAATTG ATGAGCTGCT CGAGGCCGTT GATACTTGGA 300 

5 TCCCCACCCC TCAGCGTGAG ACCGACAAGC CTTTCCTCAT GGCCGTCGAG 350 

GATGTCTTCT CCATTGCTGG TCGTGGCACT GTCGTCTCTG GCCGTGTCGA 400 

GCGAGGTATC CTGAAGCGCG ATGCTGAAGT TGAGCTCGTC GGCAAGGGCA 450 

CCGCCCCCAT CAAGACCAAG GTTACCGATA TCGAGACCTT CAAGAAGTCC 500 

TGCGAGGAGT CTCGCGCTGG TGACAACTCC GGTCTCCTTC TTCGTGGTGT 550 

10 CAAGCGTGAT GAAGTCCGCC GTGGTATGGT CGTTTCCGTC CCTGGACAGG 600 

TCAAGGCGCA CAAGAAGTTC CTTGTCTCCA TGTACGTGTT GAGCAAGGAG 650 

GAAGGTGGTC GTCACACTGG CTTCGGTGAG AACTACAGGC CGCAAATGTT 700 

CATCCGCACT GCCGACGAGT CGTGTGCCCT GTACTGGCCA GAAGGCACCG 750 

AGGACGCCCA TGACAAGCTT GTTATGCCCG GTGACAACGT CGAGATGGTT 800 

15 TGCGAGCTCC ATGCACCACA CGTCTTGGAG CCTGGTCAAC G 841 



2) INFORMATION FOR SEQ ID NO: 905 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 967 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
3 0 (A) ORGANISM: Aspergillus niger 

(B) STRAIN: ATCC 9508 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 905 

35 CGGTGCTATC ATTGTCGTCG CCGCCTCCGA CGGTCAGATG TACGTTAACC 50 

TTAAAAGAAT AACTCTCCTT CAGTATATAT GCTTACACTG GCGATCAACA 100 

GGCCCCAGAC TCGTGAGCAC TTGCTGCTTG CTCGTCAGGT CGGTGTCCAG 150 

AAGATCGTTG TCTTCGTCAA CAAGGTCGAT GCTATCGATG ACCCCGAGAT 2 00 

GCTGGAGCTC GTTGAGCTGG AAATGCGCGA GCTTCTCAGC ACCTACGGAT 250 

40 TCGAGGGTGA GGAGACCCCC ATCGTCTTCG GCTCTGCTCT CTGCGCCATT 300 

GAGGACCGCC GCCCCGACAT CGGTACCGAG CGTATCGACG CTCTCCTCGA 350 

GGCCGTTGAC ACCTGGATCC CCACTCCCCA GCGTGACCTT GACAAGCCTT 400 

TCTTGATGTC CATTGAGGAA GTTTTCTCCA TCCCCGGTCG TGGTACCGTC 450 

GCCTCCGGCC GTGTCGAGCG TGGTCTCCTG AAGCGTGATA GCGAGGTTGA 500 

45 GATCATCGGT ACCACCAACG AGGTCATCAA GACCAAGGTT ACCGACATTG 550 

AGACCTTCAA GAAGTCCTGC TCCGAGTCCC GCGCCGGTGA CAACTCCGGT 600 

CTCCTGCTCC GTGGTGTCCG CCGTGAGGAT CTCCGCCGTG GTATGGTCAT 650 

TGCCGCTCCT GGCAGCGCCA AGGC CAAC AG CAAGTTCATG GTCTCCATGT 700 

ACGTCCTGAC CGAGGCTGAR GGTGGTCGCC GTACCGGTTT CGGTGTCCAG 750 

50 TACCGTCCCC AGCTGTTCAT CCGCACTGCC GGTAAGTAAA ATTGCATTCT 800 

ATTCCGCTAC TAGGGAACCA TCTCTAATTC TATTTGCTAC AGATGAGGCT 850 

GCTGAGTTCA GCTTCCCCGA CGGAGACCAG TCCCGCCGTA TCATGCCCGG 900 

TGACAACGTC GAGATGATCG TCAAGACCCA CCGCCCCGTC GCCGCCGAGG 950 

CCGGTCAGCG CTTCAAC 967 

55 



2) INFORMATION FOR SEQ ID NO: 906 

60 (i) SEQUENCE CHARACTERISTICS: 

509 
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WO 01/23604 PCT/CA00/01 150 

(A) LENGTH: 852 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bipolaris hawaiiensis 
10 (B) STRAIN: ATCC 26067 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 906 

TGGTGCTATT ATTGTTGTTG CCGCTTCCGA CGGTCAAATG CCCCAGACTC 50 

15 GTGAGCATCT GCTGCTCGCC CGTCAGGTCG GTGTTCAGAA GATCGTTGTC 100 

TTCGTTAACA AGGTCGACGC TGTCGAGGAC AAGGAGATGT TGGAGCTTGT 150 

CGAGATGGAG ATGCGCGAAC TGCTCAGCAG CTATGGCTTC GAGGGCGACG 200 

AGACCCCTAT CATCATGGGT TCTGCTCTCT GCGCCATTGA AGGCCGTCAA 250 

CCCGACATTG GTGTCGAACG AATTGACGAG CTGCTCGAGG CTGTTGATAC 3 00 

20 TTGGATTCCC ACCCCTCAGC GTGAGACCGA AAAGCCTTTC CTCATGGCCG 350 

TCGAGGATGT CTTCTCCATT GCTGGTCGTG GTACCGTCGT CTCTGGCCGT 400 
GTCGAACGAG GTATCCTGAA GCGCGATGCT GAAGTTGAGC TTGTGGGCAA * 450 

GGGCAGCGCA CCCATCAAGA CCAAGGTTAC CGATATCGAG ACCTTCAAGA 500 

AGTCTTGCGA GGAGTCCCGC GCTGGTGACA ACTCCGGTCT CCTTCTTCGT 550 

25 GGTGTTAAGC GTGATGAAGT CCGCCGTGGT ATGGTCGTTT CCGTCCCTGG 600 

ACAGGTTAAG GCGCACAAGA AGTTCCTTGT CTCCATGTAT GTGCTGAGCA 650 

AGGAGGAAGG TGGCCGACAC ACTGGCTTCG GTGAGAACTA CAGGCCGCAA 700 

ATGTTCATCC GCACTGCCGA CGAGTCGTGT GCCCTGTACT GGCCAGAAGG 750 

CACCGAGGAT GCCCACGACA AGCTTGTCAT GCCCGGTGAC AACGTCGAGA 800 

30 TGGTTTGCGA GCTCCATGCA CCACACGTCT TGGAGACTGG TCAGCGCTTC 850 

AA 852 



35 2 ) INFORMATION FOR SEQ ID NO: 907 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 966 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus flavus 

(B) STRAIN: ATCC 26947 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 907 



GGTGCTATTG TTGTCGTTGC TGCTTCGGAT GGTCATGATG TATGGACAGG 50 

CCCTTTGCTA CTGAATGGTT TCAAGATCTC GCGCTTACAC GTATTATAAT 100 

AGGCCCCAGA CCCGGGAGCA CTTGCTGCTT GCCCGTCAGG TCGGTGTCCA 150 

GAAGATCGTC GTTTTTGTCA ACAAGATTGA TGCCGTTGAG GACCCTGAGA 2 00 

55 TGTTGGAGCT TGTCGAGTTG GAAATGCGCG AGCTCCTTAG CAGCTACGGC 250 

TTCGAGGGCG AAGAGACTCC CATCATCTTC GGTTCTGCTC TGTGTGCTTT 3 00 

GGAGGACCGT CGCCCCGACA TTGGTGCCGA GCGTATCGAC GAGCTCATGA 3 50 

AGGCCGTTGA CACCTGGATC CCTACCCCTC AGCGTGATCT TGACAAGCCT 4 00 

TTCCTCATGT CTGTCGAGGA AGTCTTCTCC ATCGCCGGTC GTGGTACCGT 450 

60 TGCCTCCGGC CGTGTCGAAC GTGGTATCCT GAAGAAGGAC AGCGAAGTCG 500 

510 
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AGATCATCGG AGGTAGCTTC GATGCTACCA 
GAGACCTTCA AGAAGTCTTG TGACGAGTCC 
CTTGCTTCTC CGTGGTATCC GTCGTGAAGA 
TTGCTGCTCC TGGCAGCACC AAGGCCCACG 
5 TACGTTCTCA CTGAGGCTGA GGGTGGTCGT 
CTACCGCCCC CAGGTGTTCG TTCGCACTGC 
TCACTTAACG GTATTGATTA AGTTCTAACT 
GCTGACCTCA GCTTCCCCGA CGGTGATGAG 
TGACAACGTC GAGATGGTCC TCAAGACTCA 
10 CTGGCCAGCG CTTCAA 



PCT/CA00/O1150 

AGACCAAGGT CACCGACATT^ 550 

CGCGCTGGTG ACAACTCTGG 600 

CGTCCGCCGC GGAATGATCA 650 

ACCAGTTCTT GGTGTCCATG 700 

CGTACTGGCT TCGGCTCCAA 750 

TGGTAAGTCA AGCCTTTTGC 800 

GTTGTATCCT AGATGAGGCT 850 

TCCCGGAGGG TGATGCCTGG 900 

CCGCCCCATT GCTGCTGAGG 950 

966 



2) INFORMATION FOR SEQ ID NO: 908 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Alternaria alternata 

(B) STRAIN: ATCC 62099 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 908 

30 GGTGCTATCA TCGTCGTTGC TGCTTCCGAT GGTCAGATGC CCCAGACCCG 50 

TGAGCACTTG CTGCTCGCCC GTCAGGTCGG TGTTCAGAAG ATCGTTGTCT 100 

TCGTCAACAA GGTCGATGCT GTCGAAGACC CGGAGATGTT GGAACTCGTC 150 

GAGATGGAGA TGCGTGAGTT ACTCACCAGC TACGGCTTCG AGGGCGACGA 200 

GACACCCATC ATCATGGGTT CCGCTCTATG CGCCATCGAG GGCCGCCAGC 250 

3 5 CCGAGATCGG TGTTACCAAG GTCGACGAGC TAATGGACGC TGTCGACTCA 300 

TGGATCCCCA CCCCTCAGCG TGAGACCGAG AAGCCTTTCC TCATGGCTGT 350 

TGAGGATGTC TTCTCGATTG CTGGACGTGG TACCGTCGTT TCGGGCCGTG 400 

TCGAGCGCGG < TATCTTGAAG CGTGACGCTG AAGTCGAGCT TGTCGGCAAG 450 

GGCACCGCGC CAATCAAGAC CAAGGTCACT GATATTGAGA CCTTCAAGAA 500 

40 GTCGTGCGAG GAGTCGCGCG CGGGTGATAA CTCCGGTCTT CTCCTCCGTG 550 
GTGTCAAGCG TGATGACGTT CGCCGCGGTA TGGTTGTTTC CGTTCCCGGA . 600 

CAAGTCAAGG CTCACAAGAA GTTCCTTGTC TCCATGTACG TTCTAAGCAA 650 

AGAGGAGGGT GGTCGTCACA CCGGCTTCGG CGAGAACTAC AGGCCGCAAA 700 

TGTTCATCCG AACTGCCGAT GAATCCTGCG CACTTCACTT CCCAGAGGGT 750 

45 ACCGAGGATG CGCACGACAA GCTAGTTATG CCCGGTGACA ACGTCGAGAT 8 00 

GGTCTGCGAA CTCCACCAGC CCCACGTTCT AGAGACCGGT CAGCG 845 



50 2) INFORMATION FOR SEQ ID NO: 909 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 931 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Penicillium marneffei ' 

(B) STRAIN: ATCC 64101 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 909 

5 

CGCTGTTGTT GTCGTCGCTG CTTCTGATGG TCAAATGTAA CATATCCACG 50 

AGCTGCCAAT TATGGACACT GCTGATAAGA ATAGGCCCCA AACCCGTGAG 100 

CACTTGCTCC TCGCCCGTCA GGTCGGTGTT CAAAAGATCG TCGTCTTCGT 150 

CAACAAGGTT GATGCCGTCG AGGACCCCGA GATGTTGGAA CTTGTCGAAT 200 

10 TGGAAATGCG TGAACTCTTG ACCACCTACG GTTTCGAGGG TGAAGAGACC 250 

CCTATCATTT TCGGATCCGC TCTTTGCGCC TTGGAAGGCC GCAAGCCCGA 3 00 

GATTGGCGAA CAGAAGATTG ACGAGCTCAT GAACGCCGTT GATACCTGGA 350 

TCCCCACCCC CCAGCGTGAC CTTGACAAGC CCTTCTTGAT GTCCGTTGAG 400 

GAAGTTTTCT CCATCTCTGG TCGTGGTACC GTTGCATCTG GTCGTGTTGA 450 

15 GCGTGGTATT TTGCGCAAGG ATTCTGAGGT TGAGATTATC GGATACCAGA 500 

AGAACCCTAT CAAGACCAAG GTTACCGACA TTGAGACCTT CAAGAAGTCT 550 

TGCGATGAAT CTCGTGCTGG TGACAACTCT GGCTTGCTTC TCCGTGGTAT 600 

CAAGCGTGAG GACATTCGTC GTGGTATGGT TATCGCTGCT CCTGGAACCA 650 

CCAAGGCTCA TGACAACTTC TTGGTCTCCA TGTATGTCTT GACTGAGGCT 700 

20 GAAGGTGGTC GTCGTACTGG ATTCGGCGCC AACTACCGTC CTCAAGCTTT 750 

CATCCGTACT GCCGGTATGT TCCCTTTCAA AGTCAATTAA TGAGCGATTT 800 

GCTAACGAGT TATAGATGAG GCTGCTACTC TCAGCTTCCC CGGTGACGAT 850 

CAGTCCAAGC AGGTCATGCC CGGTGACAAC GTTGAGATGA TCTTGAAGAC 900 

ACACCGTCCC GTTGCCGCCG AAGCTGGTCA G 931 

25 



2) INFORMATION FOR SEQ ID NO: 910 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 931 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffei 
40 (B) STRAIN: ATCC 58950 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 910 

CGCTGTTGTT GTCGTCGCTG CTTCTGATGG TCAAATGTAA CATATCCACG 50 

4 5 AGCTGCCAAT TATGGACACT GCTGATAAGA ATAGGCCCCA AACCCGTGAG 100 

CACTTGCTCC TCGCCCGTCA GGTCGGTGTT CAAAAGATCG TCGTCTTCGT 150 

CAACAAGGTT GATGCCGTCG AGGACCCCGA GATGTTGGAA CTTGTCGAAT 200 

TGGAAATGCG TGAACTCTTG ACCACCTACG GTTTCGAGGG TGAAGAGACC 250 

CCTATCATTT TCGGATCCGC TCTTTGCGCC TTGGAAGGCC GCAAGCCCGA 300 

50 GATTGGCGAA CAGAAGATTG ACGAGCTCAT GAACGCCGTT GATACCTGGA 350 

TCCCCACCCC CCAGCGTGAC CTTGACAAGC CCTTCTTGAT GTCCGTTGAG 400 

GAAGTTTTCT CCATCTCTGG TCGTGGTACC GTTGCATCTG GTCGTGTTGA 450 

GCGTGGTATT TTGCGCAAGG ATTCTGAGGT TGAGATTATC GGATACCAGA 500 

AGAACCCTAT CAAGACCAAG GTTACCGACA TTGAGACCTT CAAGAAGTCT 550 

55 TGCGATGAAT CTCGTGCTGG TGACAACTCT GGCTTGCTTC TCCGTGGTAT 600 

CAAGCGTGAG GACATTCGTC GTGGTATGGT TATCGCTGCT CCTGGAACCA 650 

CCAAGGCTCA TGACAACTTC TTGGTCTCCA TGTATGTCTT GACTGAGGCT 700 

GAAGGTGGTC GTCGTACTGG ATTCGGCGCC AACTACCGTC CTCAAGCTTT 750 

CATCCGTACT GCCGGTATGT TCCCTTTCAA AGTCAATTAA TGAGCGATTT 800 

60 GCTAACGAGT TATAGATGAG GCTGCTACTC TCAGCTTCCC CGGTGACGAT 850 

512 
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900* 
931 



2 ) INFORMATION FOR SEQ ID NO: 911 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 911 
GACGGMKKCA TGCCGCARAC 2 



2) INFORMATION FOR SEQ ID NO: 912 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 912 
GACGGCGKCA TGCCGCARAC 2 



2) INFORMATION FOR SEQ ID NO: 913 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 913 
GACGGYSYCA TGCCKCAGAC 2 



2) INFORMATION FOR SEQ ID NO: 914 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 914 
GAARAGCTGC GGRCGRTAGT G 



2) INFORMATION FOR SEQ ID NO: 915 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 915 
AAACGCCTGA GGRCGGTAGT T 



2) INFORMATION FOR SEQ ID NO: 916 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 916 
GCCGAGCTGG CCGGCTTCAG 



2) INFORMATION FOR SEQ ID NO: 917 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 917 
TCGTGCTACC CGTYGCCGCC AT 



2) INFORMATION FOR SEQ ID NO: 918 

(i) SEQUENCE CHARACTERISTICS: 

5U 
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(A) LENGTH: 13 91 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(E) ACCESSION NUMBER: J01672 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 918 

AGAGAAGCCT GTCGGCACCG TCTGGTTTGC TTTTGCCACT GCCCGCGGTG 50 

AAGGCATTAC CCGGCGGGAT GCTTCAGCGG CGACCGTGAT GCGGTGCGTC 100 

GTCAGGCTAC TGCGTATGCA TTGCAGACCT TGTGGCAACA ATTT CTACAA 150 

AACACTTGAT ACTGTATGAG CATACAGTAT AATTGCTTCA ACAGAACATA 200 

TTGACTATCC GGTATTACCC GGCATGACAG GAGTAAAAAT GGCTATCGAC 250 

GAAAACAAAC AGAAAGCGTT GGCGGCAGCA CTGGGCCAGA TTGAGAAACA 3 00 

ATTTGGTAAA GGCTCCATCA TGCGCCTGGG TGAAGACCGT TCCATGGATG 350 

TGGAAACCAT CTCTACCGGT TCGCTTTCAC TGGATATCGC GCTTGGGGCA 400 

GGTGGTCTGC CGATGGGCCG TATCGTCGAA ATCTACGGAC CGGAATCTTC 450 

CGGTAAAACC ACGCTGACGC TGCAGGTGAT CGCCGCAGCG CAGCGTGAAG 500 

GTAAAACCTG TGCGTTTATC GATGCTGAAC ACGCGCTGGA CCCAATCTAC 550 

GCACGTAAAC TGGGCGTCGA TATCGACAAC CTGCTGTGCT CCCAGCCGGA 600 

CACCGGCGAG CAGGCACTGG AAATCTGTGA CGCCCTGGCG CGTTCTGGCG 650 

CAGTAGACGT TATCGTCGTT GACTCCGTGG CGGCACTGAC GCCGAAAGCG 700 

GAAATCGAAG GCGAAATCGG CGACTCTCAC ATGGGCCTTG CGGCACGTAT 750 

GATGAGCCAG GCGATGCGTA AGCTGGCGGG TAACCTGAAG CAGTCCAACA 800 

CGCTGCTGAT CTTCATCAAC CAGATCCGTA TGAAAATTGG TGTGATGTTC 850 

GGTAACCCGG AAACCACTAC CGGTGGTAAC GCGCTGAAAT TCTACGCCTC 900 

TGTTCGTCTC GACATCCGTC GTATCGGCGC GGTGAAAGAG GGCGAAAACG 950 

TGGTGGGTAG CGAAACCCGC GTGAAAGTGG TGAAGAACAA AATCGCTGCG 1000 

CCGTTTAAAC AGGCTGAATT CCAGATCCTC TACGGCGAAG GTATCAACTT 1050 

CTACGGCGAA CTGGTTGACC TGGGCGTAAA AGAGAAGCTG ATCGAGAAAG 1100 

CAGGCGCGTG GTACAGCTAC AAAGGTGAGA AGATCGGTCA GGGTAAAGCG 1150 

AATGCGACTG CCTGGCTGAA AGATAACCCG GAAACCGCGA AAGAGATCGA 12 00 

GAAGAAAGTA CGTGAGTTGC TGCTGAGCAA CCCGAACTCA ACGCCGGATT 1250 

TCTCTGTAGA TGATAGCGAA GGCGTAGCAG AAAC TAACGA AGATTTTTAA 13 00 

TCGTCTTGTT TGATACACAA GGGTCGCATC TGCGGCCCTT TTGCTTTTTT 13 50 

AAGTTGTAAG GATATGCCAT GACAGAATCA ACATCCCGTC G 13 91 



2) INFORMATION FOR SEQ ID NO: 919 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 919 
GGICCIGART CITMIGGIAA RAC 
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2) INFORMATION FOR SEQ ID NO: 920 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 0 
TCICCVATIT CICCITCIAI YTC 



2) INFORMATION FOR SEQ ID NO: 921 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 921 
TIYRTIGAYG CIGARCAIGC 



2) INFORMATION FOR SEQ ID NO: 922 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 922 
TARAAYTTIA RIGCIYKICC ICC 



2) INFORMATION FOR SEQ ID NO: 923 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 923 
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GACGCIGCCA TCCTGATGAT C 
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21 



5 2) INFORMATION FOR SEQ ID NO: 924 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 924 

ACCTCAGTCG TCACGTTGGC G 21 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 925 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 925 
AAGCAGATGG TTGTGTGCTG 20 



2) INFORMATION FOR SEQ ID NO: 926 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 926 
CAGCTGCTCG TGGTGCATCT CGAT 24 

50. 



2) INFORMATION FOR SEQ ID NO: 92 7 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

60 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 927 
ACGCGGAGAA GGTGCGCTT 



2) INFORMATION FOR SEQ ID NO: 928 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 928 
GGTCGTTCTT CGAGTCACCG CA 



2) INFORMATION FOR SEQ ID NO: 929 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides 

(B) STRAIN: ATCC 25285 

(xi) SEQUENCE DESCRIPTION: SEQ 

TTCAGCATGC CATTTCAAAA CAGGCCGAAG 

GCTTGTGGGG AGCGTGCAAA TGAAGTTGTG 

GGAATTGGTG GACCCGCACA CGGGACGTAA 

TTATCGCAAA TACATCGAAC ATGCCGGTAG 

TATACGGCCA TGACGATTGC CGAATACTAT 

CCTGCTGATG GCAGACTCCA CTTCCCGTTG 

TGTCGAACCG TATGGAGGAG TTGCCCGGAC 
CTGTCCTCAA TCATTTCTAA CTTCTATGGC 

GAATAACGGC GAGAGCGGTT CTATTACCTT 



fragilis 



ID NO: 929 

CCGATATCGT GATTATCGCT 50 

GAAATCTTTA CCGAATTTCC 100 

GCTGATGGAG CGTACCATTA 150 

CAGCGCGTGA AGCTTCTGTG 200 

CGTGCCATGG GATTGAAAGT 250 

GGCGCAGGCA TTGCGTGAGA 300 

CGGATGCATT CCCGATGGAC 350 

CGTGCAGGCT ACGTGAAACT 400 

TATCGGTACA GTATCACC 448 



2) INFORMATION FOR SEQ ID NO: 930 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides distasonis 

(B) STRAIN: ATCC 8503 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 930 

GCTATCTCTA AACAAGCGGA AGCGGATATC GTGATTATCG CCGCCTGCGG 50 

TGAGCGTGCG AATGAGGTCG TAGAGGTATT TACGGAGTTC CCGGAATTGG 100 

TAGACCCGCA TACGGGACGT AAATTGATGG AACGTACGAT CATTATCGCC 150 

AATACATCCA ACATGCCGGT AGCCGCTCGT GAGGCATCCG TATATACGGC 200 

GATGACCATC GCCGAGTATT ATCGCAGCAT GGGTTTGAAG GTTCTGTTGA 250 

TGGCCGACTC TACTTCCCGC TGGGCACAGG CTTTGCGTGA GATGTCCAAC 300 

CGTTTGGAGG AGTTGCCGGG ACCGGATGCT TTCCCGATGG ACTTGTCCGC 350 

TATCGTGGCG AACTTCTACG CTCGTGCGGG ATTCGTTCAT TTGAATAACA 400 

ACGCTACAGG CTCCGTCACT TTCATCGGTA CGGTATCG 438 



2) INFORMATION FOR SEQ ID NO: 931 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas asaccharolytica 

(B) STRAIN: ATCC 25260 

(xi) SEQUENCE DESCRIPTION: SEQ 

CCTCCAGCAC GCTATCTCTA AGCAGGCGGA 
CAGCCTGCGG TGAGCGTGCT AATGAGGTGG 
CCTGAGCTCG AAGACCCACA CACGGGACGC 
CATCATCGCT AACACGAGTA ACATGCCAGT 
TCTACACCGC TATGACCATC GCTGAGTACT 
GTACTCCTAA TGGCTGACTC GACCTCTCGC 
GATGTCTAAC CGTCTAGAGG AGCTGCCTGG 
ACTTGTCGGC TATCGTGGCA AACTTCTACG 
CTCAACAACG GTGAGACAGG TTCTGTAACC 
AGC 



ID NO: 931 

GGCTGATATC GTCATTATGG 50 

TGGAGATCTT TGCCGAGTTC 100 

AAGCTGATGG AGCGTACGAT 150 

GGCTGCTCGT GAGGCTTCGG 2 00 

ACCGCTCGAT GGGTCTCAAA 250 

TGGGCACAGG CACTGCGTGA 3 00 

ACCAGATGCA TTCCCGATGG 3 50 

CTCGTGCCGG CTTCGTCTAT 400 

TTCATCGGTA CGGTCTCTCC 450 

453 



2) INFORMATION FOR SEQ ID NO: 932 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Listeria monocytogenes 

(B) STRAIN: ATCC 15313 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 932 

CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCACAAACTC 50 

GTGAACATAT CTTACTTTCA CGTCAAGTTG GTGTTCCATA CATCGTTGTA 100 

TTCATGAACA AATGTGACAT GGTTGACGAT GAAGAATTAC TAGAATTAGT 150 

TGAAATGGAA ATTCGTGATC TATTAACTGA ATATGAATTC CCTGGCGATG 200 

ACATTCCTGT AATCAAAGGT TCAGCTCTTA AAGCACTTCA AGGTGAAGCT 250 

GACTGGGAAG CTAAAATTGA CGAGTTAATG GAAGCTGTAG ATTCTTACAT 300 

TCCAACTCCW GAACGTGATA CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ATGTATTCTC AATCACTGGT CGTGGAACAG TTGCAACTGG ACGTGTTGAA 400 

CGTGGACAAG TTAAAGTTGG TGACGAAGTA GAAGTTATCG GTATCGAAGA 450 

AGAAAGCAAA AAAGTAGTAG TAACTGGAGT AGAAATGTTC CGTAAATTAC 500 

TAGACTACGC TGAAGCTGGC GACAACATTG GCGCACTTCT ACGTGGTGTT 550 

GCTCGTGAAG ATATCCAACR TGGTCAAGTA TTAGCTAAAC CAGGTTCGAT 600 

TACTCCACAC ACTAACTTCA AAGCTGAAAC TTATGTTTTA ACTAAAGAAG 650 

AAGGTGGACG TCACACTCCA TTCTTCAACA ACTACCGCCC ACAATTCTAT 700 

TTCCGTACTA CTGACGTAAC TGGTATTGTT ACACTTCCAG AAGGTACTGA 750 

AATGGTAAYG CCTGGTGATA ACATTGAGCT TGCAGTTGAA CTAATTGCAC 800 

CAATCGCTAT CGAAGACGGT ACTAAATTCT CTATC 835 



2) INFORMATION FOR SEQ ID NO: 933 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 933 
CATCATCGTI TTCMTGAACA ARTG 



2) INFORMATION FOR SEQ ID NO: 934 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 934 
TCACGYTTRR TACCACGCAG IAGA 



2) INFORMATION FOR SEQ ID NO: 935 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 5 
GGIAARWSIC ARYTITGYCA YAC 



2) INFORMATION FOR SEQ ID NO: 936 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 
(■C) STRANDEDNESS: Single 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 936 
TCISIYTCIG GIARRCAIGG 



2) INFORMATION FOR SEQ ID NO: 937 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 937 
ATIACIGARG YITTYGGIGA RTT 



2) INFORMATION FOR SEQ ID NO: 938 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 8 
CYIGTIGYIS WIGCRTGIGC 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1203 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 
(C) ACCESSION NUMBER: D10023 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 9 

ATGTCTCAAG TTCAAGAACA ACATATATCA GAGTCACAGC TTCAGTACGG 50 

GAACGGTTCG TTGATGTCCA CTGTACCAGC AGACCTTTCA CAGTCAGTCG 100 

TTGATGGAAA CGGCAACGGT AGCAGCGAAG ATATTGAGGC CACCAACGGC 150 

20 TCCGGCGATG GTGGCGGATT GCAGGAGCAA GCGGAAGCGC AAGGTGAAAT 200 

GGAGGATGAA GCATACGATG AAGCTGCCTT AGGTTCGTTT GTGCCAATAG 250 

AAAAACTGCA AGTGAACGGG ATTACTATGG CGGATGTGAA AAAAC T AAGG 300 

GAGAGTGGGC TTCACACTGC TGAAGCGGTA GCATATGCTC CCAGAAAGGA 3 50 

TTTATTGGAA ATCAAAGGTA TATCGGAAGC TAAGGCAGAT AAGTTGCTAA 4 00 

25 ACGAAGCGGC AAGGCTAGTG CCTATGGGAT TTGTCACGGC TGCTGATTTT 450 

CATATGAGAA GATCGGAGCT GATTTGTTTG ACAACGGGTT CTAAGAATTT 500 

GGACACTCTT TTGGGTGGTG GTGTGGAAAC TGGTTCTATT ACTGAGCTTT 550 

TCGGTGAATT CAGGACAGGT AAGTCCCAGC TATGTCACAC TTTGGCCGTG 600 

AC ATGC CAAA TTCCATTGGA TATTGGTGGC GGTGAAGGTA AGTGTTTGTA 650 

3 0 TATCGATACC GAAGGTACTT TCAGGCCGGT AAGATTGGTA TCCATAGCTC 700 

AGCGGTTCGG ATTAGACCCG GATGATGCTT TGAACAACGT TGCGTATGCA 750 

AGAGCCTATA ACGCCGATCA TCAGTTAAGA CTTCTGGATG CTGCTGCCCA 800 

AATGATGAGC GAGTCTCGGT TTTCCTTGAT TGTGGTCGAT TCTGTTATGG 850 

CTCTATACCG TACGGATTTT TCTGGTCGTG GTGAACTAAG CGCAAGGCAA 900 

35 ATGCATTTAG CCAAATTTAT GCGTGCTTTG CAAAGGCTGG CCGACCAATT 950 

TGGTGTTGCA GTCGTCGTTA CTAACCAAGT GGTCGCCCAA GTTGATGGTG 1000 

GTATGGCTTT TAATCCAGAT CCAAAGAAGC CTATCGGTGG TAATATTATG 1050 

GCACATTCTT CCACCACGCG ATTAGGTTTC AAAAAGGGTA AGGGATGTCA 1100 

AAGATTATGC AAAGTTGTTG ACTCACCTTG CTTACCAGAG GCTGAATGTG 1150 

40 TGTTCGCGAT CTATGAAGAT GGTGTTGGTG ACCCCAGAGA AGAAGACGAG 12 00 

TAG 1203 



45 2) INFORMATION FOR SEQ ID NO: 940 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1800 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(B) STRAIN: GRF88 

(C) ACCESSION NUMBER: M87549 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 940 
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CGATCCAATT GCTGGTCTTA AGATGCATTT GATTGATCTA GGTATTGCC^ 50 

CTGAAGCTGA AGTCAAAGCT TACGACAAGT CCGCTAGAAA ATACGTTGAC 100 

GAACAAGTTG AATTAGCTGA TGCTGCTCCT CCTCCAGAAG CCAAATTATC 150 

5 CATCTTGTTT GAAGACGTCT ACGTGAAAGG TACAGAAACT CCAACCCTAA 2 00 

GAGGTAGGAT CCCTGAAGAT ACTTGGGACT TCAAAAAGCA AGGTTTTGCC 2 50 

TCTAGGGATT AATTAAATCG TAAGGAAAAA TAAAATAATA GTGCTGTGAT 3 00 

CGCATGATAT TCTTCCCTGG AAGCGCCATT TTATAGCAAG AAATGTAAGT 350 

CAAGTATATT TTAACTGTAT ATACAACAAT ATGACTCTTT TTTATGCCTT 4 00 

10 GTTGTTTTTC TTCGGGTTTT CCCACACATT GTGTGGAGAG ATAGTTATTA 450 

ACAGACCGAA AATAGCCGCC CAAGGATAAA CTTTTATATA AAGGGAAGGG 500 

TAGTTGACCC AAAAATTTGG ATTCTACTTT CCAGATTTAC TTTCACCCTT 550 

TTATATTTGC TGTAGTCTGT TATGCCAATC AGGAAAGCAT TTGAACAAAT 600 

ATGTCTGTTA CAGGAACTGA GATCGATAGT GATACAGCAA AAAATATTCT 650 

15 TAGTGTAGAT GAATTACAGA ACTATGGTAT TAATGCCTCA GATCTTCAAA 7 00 

AATTGAAGTC TGGTGGGATA TACACAGTCA ATGTATGTTA TAATAACATT 750 

TTTAAAACCT CTGCTGTAGA GGTTCTTTCC CCCTTTCTTT TACTAACTAA 800 

TAATTTGGAA AGGAACTTTT ATAGACCGTT TTGTCAACAA CAAGAAGACA 850 

TCTATGTAAA ATTAAAGGGT TAAGTGAGGT GAAAGTGGAA AAAATTAAAG 900 

20 AAGCTGCTGG AAAAATCATA CAGGTGGGAT TTATCCCCGC TACTGTACAA 950 

TTAGATATAA GACAGCGTGT GTACTCCTTA TCAACTGGAT CTAAGCAACT 1000 

AGATTCAATC CTAGGTGGTG GAATAATGAC AATGAGTATC ACTGAAGTAT 1050 

TTGGTGAATT TAGGTGTGGT AAGACACAGA TGTCTCATAC TTTGTGTGTT 110 0 

ACCACGCAGC TTCCGAGGGA AATGGGTGGT GGTGAAGGGA AAGTAGCATA 1150 

25 TATTGATACA GAAGGCACTT TCAGGCCCGA GAGGATTAAG CAAATTGCAG 1200 

AAGGTTATGA ATTGGATCCC GAGTCATGTT TGGCAAACGT TTCATATGCT 12 50 

AGAGCCTTGA ATAGTGAACA TCAAATGGAA CTTGTTGAAC AATTGGGTGA 13 00 

AGAACTTAGT TCTGGAGATT ATCGCCTTAT CGTGGTAGAT TCTATAATGG 13 50 

CAAACTTCAG AGTAGACTAC TGCGGTAGAG GTGAACTAAG CGAAAGACAG 14 00 

30 CAAAAGCTAA ATCAACATCT TTTCAAATTG AATAGATTGG CAGAGGAATT 1450 

TAATGTTGCA GTATTTCTGA CAAAC C AAGT TCAATCAGAC CCAGGTGCTT 1500 

CTGCATTATT TGCCTCGGCA GATGGTAGGA AACCAATTGG AGGGCACGTT 1550 

CTGGCACATG CGTCAGCAAC AAGGATTTTG TTGAGAAAAG GGCGTGGTGA 1600 

CGAAAGAGTT GCCAAGTTAC AAGATTCCCC AGATATGCCT GAAAAAGAAT 1650 

35 GTGTCTACGT AATTGGTGAA AAAGGTATTA CCGATTCAAG TGACTAGTTT 1700 

TTGTATACTT TTTTAATGAA GATGACATTG CTCCTTTATT AAACTTTTCT 1750 

TTTACTTTGT GTTACTAATA TTATTAATAT CTTGTTATGA TTCTTTGTTT 1800 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 430 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus humicolus 

(B) STRAIN: ATCC 38294 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 941 

CGTCCTTATC CAGGAGCTCA TTAACAACAT TGCCAAGGCC CACGGTGGTT 50 

TCTCCGTCTT CACCGGTGTC GGTGAGCGTA CCCGTGAGGG TAACGACCTG 100 

TACCACGAGA TGCGTGAGAC TGGTGTCATC AACCTCGAGG GCGACTCCAA 150 

GGTCGCTCTC GTCTTCGGCC AGATGAACGA GCCCCCCGGA GCCCGTGCCC 200 

GTGTCGCCCT TACCGGCCTC ACCATCGCCG AGTACTTCCG TGACGAGGAG 250 

GGTCAGGACG TGCTTCTCTT CATCGACAAC ATTTTCCGTT TCACCCAGGC 3 00 

CGGTTCCGAG GTGTCTGCCC TTCTCGGTCG TATCCCCTCG GCCGTCGGTT 3 50 

ACCAGCCCAC CCTCGCTACC GACATGGGTT CCATGCAGGA GCGTATCACC 400 

ACCACCAAGA AGGGTTCGAT TACCTCCGTC 430 



2) INFORMATION FOR SEQ ID NO: 942 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 794 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 43895 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 942 

CGTGTACGAT GCTCTTGAGG TGCAAAATGG TAATGAGCGT CTGGTGCTGG 50 

AAGTTCAGCA GCAGCTCGGC GGCGGTATCG TGCGTACCAT CGCAATGGGT 100 

TCCTCCGACG GTCTGCGTCG CGGTCTGGAT GTAAAAGACC TCGAACACCC 150 

GATCGAAGTC CCGGTAGGTA AAGCGACTCT GGGCCGTATC ATGAACGTAC 200 

TGGGTGAACC GGTCGACATG AAAGGCGAGA TCGGTGAAGA AGAGCGTTGG 250 

GCGATTCACC GCGCAGCACC TTCCTACGAA GAGCTGTCAA ACTCTCAGGA 300 

ACTGCTGGAA ACCGGTATCA AAGTTATCGA CCTGATGTGT CCGTTCGCTA 350 

AGGGCGGTAA AGTTGGTCTG TTCGGTGGTG CGGGTGTAGG TAAAACCGTA 400 

AACATGATGG AGCTCATTCG TAACATCGCG ATCGAGCACT CCGGTTACTC 450 

TGTGTTTGCG GGCGTAGGTG AACGTACTCG TGAGGGGAAC GACTTCTACC 500 

ACGAAATGAC CGACTCCAAC GTTATCGATA AAGTATCCCT GGTGTATGGC 550 

CAGATGAACG AGCCGCCGGG AAACCGTCTG CGCGTAGCTC TGACCGGTCT 600 

GACCATGGCT GAGAAATTCC GTGACGAAGG TCGTGACGTT CTGCTGTTCG 650 

TTGACAACAT CTATCGTTAC ACCCTGGCCG GTACGGAAGT ATCCGCACTG 700 

CTGGGCCGTA TGCCTTCAGC GGTAGGTTAT CAGCCGACCC TGGCGGAAGA 750 

GATGGGCGTT CTGCAGGAAC GTATCACCTC CACCAAAACC GGTT 794 
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2 ) INFORMATION FOR SEQ ID NO: 943 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 <D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 35401 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 943 

20 ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTGCAAAA TGGTAATGAG 50 

CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA TCGTACGTAC 100 

CATCGCAATG GGTTCCTCCG ACGGTCTGCG TCGCGGTCTG GATGTAAAAG 150 

ACCTCGAACA CCCGATTGAA GTCCCGGTAG GTAAAGCGAC TCTGGGCCGT 2 00 

ATCATGAACG TACTGGGTGA ACCGGTCGAC ATGAAAGGCG AGATCGGTGA 250 

25 AGAAGAGCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAC GAAGAGCTGT 3 00 

CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT CGACCTGATG 350 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCGGGTGT 400 

AGGTAAAACC GTAAACATGA TGGAGCTCAT TCGTAACATC GCGATCGAGC 450 

ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT 500 

30 AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG ACAAAGTATC 550 

CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CTCTGACCGG TCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTTCTGCTGT TCGTTGACAA CATCTATCGT TACACCCTGG CCGGTACGGA 700 

AGTATC CGC A CTGCTGGGCC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

35 CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC CTCCACCAAA 800 

ACTGGTTCTA TCAC 814 



40 2 ) INFORMATION FOR SEQ ID NO: 944 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 11775 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 944 



GTGTACGATG CTCTTGAGGT GCAAAATGGT AATGAGCGTC TGGTGCTGGA 50 

AGTTCAGCAG CAGCTCGGCG GCGGTATCGT GCGTACCATC GCAATGGGTT 100 

CCTCCGACGG TCTGCGTCGC GGTCTGGATG TAAAAGACCT CGAACACCCG 150 

ATCGAAGTCC CGGTAGGTAA AGCGACTCTG GGCCGTATCA TGAACGTACT 200 

60 GGGTGAACCG GTCGACATGA AAGGCGAGAT CGGTGAAGAA GAGCGTTGGG 2 50 
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CGATTCACCG CGCAGCACCT TCCTACGAAG AGCTGTCAAA CTCTCAGGAA 3 0/0 

CTGCTGGAAA CCGGTATCAA AGTTATCGAC CTGATGTGTC CGTTCGCTAA 350 

GGGCGGTAAA GTTGGTCTGT TCGGTGGTGC GGGTGTAGGT AAAACCGTAA 4 00 

ACATGATGGA GCTTATTCGT AACATCGCGA TCGAGCACTC CGGTTACTCT 4 50 

5 GTGTTTGCGG GCGTAGGTGA ACGTACTCGT GAGGGTAACG ACTTCTACCA 500 

CGAAATGACC GACTCCAACG TTATCGACAA AGTATCCCTG GTGTATGGCC 550 

AGATGAACGA GCCGCCGGGA AACCGTCTGC GCGTTGCTCT GACCGGTCTG 600 

ACCATGGCTG AGAAATTCCG TGACGAAGGT CGTGACGTTC TGCTGTTCGT 650 

TGACAACATC TATCGTTACA CCCTGGCCGG TACGGAAGTA TCCGCACTGC 700 

10 TGGGCCGTAT GCCTTCAGCG GTAGGTTATC AGCCGACCCT GGCGGAAGAG 750 

ATGGGCGTTC TGCAGGAACG TATCACCTCC ACCAAAACCG GTTCTATC 798 



15 2) INFORMATION FOR SEQ ID NO: 945 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 25922 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 945 



30 



ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTGCAAAA TGGTAATGAG 50 

CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA TCGTGCGTAC 100 

CATCGCAATG GGTTCCTCCG ACGGTCTGCG TCGCGGTCTG GATGTAAAAG 150 

ACCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCGAC TCTGGGCCGT 200 

3 5 ATCATGAACG TACTGGGTGA ACCGGTCGAC ATGAAAGGCG AGATCGGTGA 250 

AGAAGAGCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAC GAAGAGCTGT 3 00 

CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT CGACCTGATG 350 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCGGGTGT 400 

AGGTAAAACC GTAAACATGA TGGAGCTTAT TCGTAACATC GCGATCGAGC 450 

40 ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT 500 

AACGACTTCT AC C ACGAAAT GACCGACTCC AACGTTATCG ACAAAGTATC 550 

CTTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CTCTGACCGG TCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTATTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG CCGGTACCGA 700 

45 AGTATCCGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC CTCCACCAAA 800 

ACCGGTTCTA TC 812 



50 

2) INFORMATION FOR SEQ ID NO: 946 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 832 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

60 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria polysaccharea 

(B) STRAIN: ATCC 43768 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 6 

GCGACGCTAT CCCGCATGTT TACGATGCCC TGAAATTGGA CGAGAACGGT 50 

CTGACTCTGG AAGTTCAACA ACTTCTGGGT GACGGCGTTG TCCGTACTAT 100 

TGCAATGGGT AGTTCAGACG GCCTGAAACG CGGCATGTCT GTAAGCAATA 150 

CTGGTGCGCC AATCACTGTG CCGGTAGGTA AAGGTACTTT GGGTCGTATT 2 00 

GTCGACGTAT TGGGTACGCC TGTTGATGAA GCAGGTCCGA TCGATACCGA 250 

CAAGAGCCGT GCCATTCACC AAACTGCTCC GAAATTCGAC GAGTTGTCTG 300 

CAACTACCGA ATTGTTGGAA ACCGGTATTA AAGTGATCGA CTTGCTGTGT 350 

CCGTTTGCTA AAGGCGGTAA AGTAGGTCTG TTCGGTGGTG CCGGTGTAGG 4 00 

CAAAACCGTG AACATGATGG AATTGATCAA CAACATCGCC AAAGCGCACA 450 

GCGGTCTGTC CGTGTTCGCA GGTGTGGGCG AGCGTACCCG TGAAGGTAAC 500 

GACTTCTACC ACGAGATGAA AGATTCCAAC GTATTGGATA AAGTGGCAAT 550 

GGTTTACGGT CAGATGAACG AACCTCCGGG CAACCGTTTG CGCGTCGCAT 600 

TGACCGGTTT GACCATGGCG GAATACTTCC GTGACGAAAA AGACGAAAAC 650 

GGTAAAGGTC GCGACGTATT GTTCTTCGTT GACAACATCT ACCGTTACAC 700 

TCTGGCCGGT ACCGAAGTAT CTGCACTGTT GGGCCGTATG CCTTCTGCAG 7 50 

TGGGTTACCA ACCGACATTG GCTGAAGAAA TGGGTCGTTT GCAAGAGCGT 800 

ATTACCTCTA CCCAAACCGG TTCCATTACT TC 832 



2) INFORMATION FOR SEQ ID NO: 947 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 840 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria sicca 

(B) STRAIN: ATCC 9913 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 7 

TCCGCGCGAT GCCATTCCGC ATGTTTACGA CGCCCTGAAA TTGGATGCAA 50 

ACGGCCTGAC TTTGGAAGTA CAACAGCTTC TGGGCGACGG CGTGGTTCGT 100 

ACTATTGCAA TGGGTAGTTC GGACGGTCTG AAACGCGGCA TGACTGTAAG 150 

CAATACAGAT GCGCCGATTA CTGTGCCGGT AGGTAAAGGT ACTTTGGGAC 200 

GTATTGTCGA TGTGTTGGGT ACACCTGTTG ATGAAGCAGG TCCGATTGAT 250 

ACCGACAAAC ACCGTGCTAT CCATCAGACA GCTCCGAAAT TCGATGAGTT 300 

GTCTGCTACT ACCGAGCTGC TGGAAACAGG CATTAAAGTG ATTGACTTGC 350 

TGTGTCCGTT TGCCAAAGGC GGTAAAGTAG GTCTGTTCGG TGGTGCCGGT 400 

GTAGGCAAAA CCGTCAACAT GATGGAATTG ATTAACAACA TCGCCAAAGC 450 

GCATAGTGGT TTGTCCGTGT TCGCCGGTGT GGGGGAACGT ACCCGTGAAG 500 

GTAACGACTT CTACCACGAG ATGAAAGATT CCAACGTATT GGACAAAGTG 550 

GCGATGGTTT ACGGTCAGAT GAACGAACCT CCGGGTAACC GTCTGCGTGT 600 

AGCCTTGACC GGTTTGACGA TGGCCGAATA CTTCCGTGAT GAAAAAGACG 650 

AAAGCGGCAA AGGTCGCGAC GTATTGTTCT TCGTGGACAA CATTTACCGT 7 00 

TACACTCTGG CCGGTACAGA AGTATCCGCA TTGCTCGGTC GTATGCCTTC 750 

AGCAGTAGGT TACCAACCGA CATTGGCTGA AGAAATGGGT CGTCTGCAAG 800 

AGCGTATTAC CCTCTACTCA AACAGGCTCC ATTACTTCTA 840 
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2) INFORMATION FOR SEQ ID NO: 948 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 903 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 948 

GCAGCTGGCG ACAAGCTACC TGAGATCAAT AATGCACTTG TAGTCTATAA 50 

AAATGACGAA AAAAAATCAA AAATCGTCCT TGAAGTAGCT CTTGAGCTTG 100 

GTGATGGAGT GGTTCGGACC ATCGCTATGG AATCAACGGA TGGGTTGACT 150 

CGTGGCATGG AAGTGCTAGA TACTGGCCGT CCAATTTCTG TGCCAGTCGG 2 00 

CAAAGAAACA CTTGGTCGCG TCTTTAACGT TTTGGGAGAT ACCATTGACT 2 50 

TGGATGCTCC TTTTGCGGAT GATGCAGAGC GCCAGCCAAT CCATAAGAAA 3 00 

GCTCCAACCT TTGATGAGTT GTCTACTTCT TCAGAGATCT TAGAGACAGG 3 50 

TATCAAGGTT ATCGACCTGT TAGCCCCTTA TCTGAAAGGT GGTAAAGTTG 400 

GACTCTTCGG TGGTGCCGGA GTTGGTAAGA CCGTCCTGAT TCAAGAATTG 450 

ATCCACAACA TTGCCCAAGA ACACGGTGGT ATTTCTGTAT TTACTGGCGT 500 

TGGGGAACGT ACCCGTGAAG GGAATGACCT TTATTGGGAA ATGAAAGAGT 550 

CTGGTGTTAT CGAGAAAACA GCCATGGTCT TCGGTCAGAT GAATGAGCCG 600 

CCAGGAGCGC GTATGCGGGT TGCTTTGACT GGTTTGACGA TTGCAGAATA 650 

CTTCCGTGAT GTGGAAGGTC AAGATGTCTT GCTCTTCATT GACAACATCT 700 

TCCGTTTCAC GCAGGCAGGT TCTGAAGTTT CTGCCCTTTT GGGTCGGATG 750 

CCGTCAGCCG TTGGTTACCA ACCAACACTT GCGACAGAAA TGGGGCAATT 800 

GCAAGAGCGT ATCACATCGA CTAAGAAGGG TTCTGTAACC TCT 843 



2) INFORMATION FOR SEQ ID NO: 949 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 49456 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 949 

GCAGCAGGGG AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA 50 

AAATGACGAA AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTGG 100 

GAGATGGTAT GGTCCGTACT ATCGCCATGG AATCAACAGA TGGTTTGACT 150 

CGTGGAATGG AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG 200 

TAAAGAAACT TTGGGACGTG TCTTCAATGT TTTGGGAGAT ACCATTGACT 2 50 

TGGAAGCTCC TTTTACAGAA GATGCAGAGC GTCAGCCAAT TCATAAAAAA 300 
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GCTCGAACTT 
GATTAAGGTT 
GACTTTTCGG 
ATTCACAACA 
TGGGGAACGT 
CAGGTGTTAT 
CCAGGAGCAC 
CTTCCGTGAT 
TCCGTTTCAC 
CCATCAGCCG 
GCAAGAGCGT 



TTGATGAATT 
ATCGACCTTC 
TGGTGCCGGA 
TTGCCCAAGA 
ACTCGTGAGG 
CGAGAAAACA 
GTATGCGTGT 
GTGGAAGGCC 
TCAGGCTGGT 
TTGGTTACCA 
ATTACATCAA 



GTCTACCTCT 
TTGCCCCTTA 
GTTGGTAAAA 
ACACGGTGGT 
GTAATGACCT 
GCCATGGTAT 
TGCCCTAACT 
AAGACGTGCT 
TCAGAAGTAT 
ACCAACACTT 
CTAAAAAGGG 



TCTGAAATCC TTGAAACAGG 
CCTTAAAGGT GGTAAGGTTG 
CCGTCTTAAT CCAAGAATTG 
ATTTCAGTAT TTACCGGTGT 
TTACTGGGAA ATGAAAGAAT 
TTGGTCAGAT GAATGAGCCG 
GGTTTGACAA TCGCCGAATA 
TCTCTTTATC GATAATATCT 
CTGCCCTTTT GGGTCGTATG 
GCTACGGAAA TGGGTCAATT 
TTCTGTAACC T 



350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
841 
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15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 950 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: LSPQ 2583 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 950 

GCTACCTGAG AT CAATAATG CACTTGTAGT CTATAAAAAT GACGAAAATA 50 

AATCAAAAAT CGTCCTTGAA GTAGCTCTTG AGCTTGGTGA TGGAGTGGTT 100 

CGGACCATCG CTATGGAATC AACGGATGGG TTGACTCGTG GCATGGAAGT 150 

GCTAGATACT GGTCGTCCAA TTTCTGTGCC AGTCGGCAAA GAAACACTTG 2 00 

GTCGCGTCTT TAACGTTTTG GGAGATACCA TTGACTTGGA TGCTCCTTTT 250 

GCGGATGATG CAGAGCGCCA GCCAATCCAT AAGAAAGCTC CAACCTTTGA 3 00 

TGAGTTGTCT ACTTCATCAG AGATCTTAGA GACAGGTATC AAGGTTATCG 3 50 

ACCTGTTAGC ACCTTATCTG AAAGGTGGTA AAGTCGGACT CTTCGGTGGT 400 

GCCGGAGTTG GTAAGACCGT CCTGATTCAG GAATTGATCC ACAACATTGC 450 

CCAAGAGCAT GGTGGTATTT CCGTGTTTAC CGGTGTTGGG GAACGTACCC 500 

GTGAAGGGAA TGACCTTTAC TGGGAAATGA AGGAGTCTGG CGTTATCGAG 550 

AAAACAGCCA TGGTCTTCGG TCAGATGAAT GAGCCACCAG GAGCGCGTAT 600 

GCGGGTTGCT TTGACTGGTT TGACGATTGC AGAGTACTTC CGTGATGTAG 650 

AAGGTCAAGA TGTCTTGCTC TTCATTGACA ACATCTTCCG TTTCACGCAG 700 

GCAGGTTCTG AAGTCTCTGC CCTTTTGGGT CGGATGCCAT CAGCCGTTGG 750 

TTACCAACCA ACACTTGCGA CTGAAATGGG ACAACTCCAA GAGCGTATTA 800 

CATCGACTAA GAAAGGTTCT GTAACTT 82 7 



2 ) INFORMATION FOR SEQ ID NO : 951 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 



PCT/CA00/01150 



(xi) SEQUENCE DESCRIPTION: SEQ 



ID NO: 951 



GCAGCAGGGG 
AAATGACGAA 
GTGATGGTAT 
CGTGGAATGG 
TAAAGAAACT 
TGGATGCTCC 
GCTCCAACTT 
GATTAAGGTT 
GACTCTTCGG 
ATTCACAACA 
TGGAGAACGT 
CAGGCGTTAT 
CCTGGAGCAC 
CTTCCGTGAT 
TCCGTTTCAC 
CCTTCAGCCG 
GCAAGAACGT 



AAACACTTCC 
AGAAAAACAA 
GGTCCGTACG 
AAGTTTTGGA 
TTGGGACGTG 
TTTCGCTGAA 
TTGATGAATT 
ATCGACCTTC 
TGGTGCCGGA 
TTGCCCAAGA 
ACCCGTGAGG 
CGAGAAAACA 
GTATGCGTGT 
GTAGAAGGCC 
TCAAGCTGGT 
TTGGTTACCA 
ATCACATCAA 



TGAGATTAAT 
AAATCGTCCT 
ATCGCCATGG 
CACAGGCCGT 
TCTTCAACGT 
GACGCTGAGC 
GTCTACCTCA 
TTGCCCCTTA 
GTTGGTAAAA 
ACATGGTGGT 
GGAACGACCT 
GCCATGGTAT 
TGCTCTTACT 
AAGATGTGCT 
TCAGAAGTAT 
ACCAACACTT 
CTAAGAAGGG 



AATGCACTTG 
TGAAGTAGCC 
AATCAACAGA 
CCAATCTCTG 
TTTGGGAGAT 
GTCAGCCAAT 
TCTGAAATCT 
CCTTAAAGGT 
CTGTCTTGAT 
ATTTCAGTAT 
TTACTGGGAA 
TTGGTCAGAT 
GGTTTGACAA 
TCTCTTTATC 
CTGCCCTTTT 
GCTACGGAAA 
TTCTGTAACC 



TCGTCTACAA 
TTGGAGTTGG 
TGGTTTGACT 
TACCAGTAGG 
ACTATTGACT 
TCATAAGAAA 
TGGAAACAGG 
GGGAAAGTTG 
CCAAGAGTTG 
TTACCGGTGT 
ATGAAAGAAT 
GAATGAGCCA 
TCGCCGAATA 
GACAATATCT 
GGGTCGGATG 
TGGGTCAATT 
TCTA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
844 



2) INFORMATION FOR SEQ ID NO: 952 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 952 

GCAGCAGGGG AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA 50 

AAATGACGAA AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTAG 100 

GAGATGGTAT GGTTCGTACT ATCGCCATGG AATCAACAGA TGGGTTGACT 150 

CGTGGAATGG AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG 200 

TAAAGAAACT TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACCATTGATT 250 

TGGAAGCTCC TTTTACAGAA GACGCAGAGC GTCAGCCAAT TCATAAAAAA 300 

GCTCCAACTT TTGATGAGTT GTCTACCTCT TCTGAAATCC TTGAAACAGG 350 

GATCAAGGTT ATTGACCTTC TTGCCCCTTA CCTTAAAGGT GGTAAAGTTG 400 

GACTTTTCGG TGGTGCCGGA GTTGGTAAAA CCGTCTTAAT CCAAGAATTG 450 

ATTCACAACA TTGCCCAAGA GCACGGTGGT ATTTCAGTAT TTACTGGTGT 500 

TGGGGAACGT ACTCGTGAGG GGAATGACCT TTACTGGGAA ATGAAAGAAT 550 

CAGGCGTTAT CGAGAAAACA GCCATGGTCT TTGGTCAGAT GAATGAGCCA 600 

CCAGGAGCAC GTATGCGTGT TGCCCTTACT GGTTTGACAA TCGCTGAATA 650 

CTTCCGTGAT GTGGAAGGCC AAGACGTGCT TCTCTTTATC GATAATATCT 700 

TCCGTTTCAC TCAGGCTGGT TCAGAAGTAT CTGCCCTTTT GGGTCGTATG 750 

CCATCAGCCG TTGGTTACCA ACCAACACTT GCTACGGAAA TGGGTCAATT 800 

GCAAGAACGT ATCACATCAA CCAAGAAGGG 83 0 
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2) INFORMATION FOR SEQ ID NO: 953 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
15 (B) STRAIN: StrR-11 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 953 

AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA AAATGACGAA 50 

2 0 AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTAG GAGATGGTAT 10 0 

GGTTCGTACT ATCGCCATGG AATCAACAGA TGGGTTGACT CGTGGAATGG 150 

AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG TAAAGAAACT 200 

TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACCATTGATT TGGAAGCTCC 250 

TTTTACAGAA GACGCAGAGC GTCAGCCAAT TCATAAAAAA GCTCCAACTT 300 

25 TTGATGAGTT GTCTACCTCT TCTGAAATCC TTGAAACAGG GATCAAGGTT 350 

ATTGACCTTC TTGCCCCTTA CCTTAAAGGT GGTAAAGTTG GACTTTTCGG 400 

TGGTGCCGGA GTTGGTAAAA CCGTCTTAAT CCAAGAATTG ATTCACAACA 450 

TTGCCCAAGA GCACGGTGGT ATTTCAGTAT TTACTGGTGT TGGGGAACGT 50 0 

ACTCGTGAGG GGAATGACCT TTACTGGGAA ATGAAAGAAT CAGGCGTTAT 550 

3 0 CGAGAAAACA GCCATGGTCT TTGGTCAGAT GAATGAGCCA CCAGGAGCAC 60 0 

GTATGCGTGT TGCCCTTACT GGTTTGACAA TCGCTGAATA CTTCCGTGAT 650 

GTGGAAGGCC AAGACGTGCT TCTCTTTATC GATAATATCT TCCGTTTCAC 700 

TCAGGCTGGT TCAGAAGTAT CTGCCCTTTT GGGTCGTATG CCATCAGCCG 750 

TTGGTTACCA ACCAACACTT G C TACGG AAA TGGGTCAATT GCAAGAACGT 800 

35 ATCACATCAA CTAAGAAGGG TTC 823 



2) INFORMATION FOR SEQ ID NO: 954 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) .ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 954 

55 GCAGCAGGGG AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA 50 

AAATGACGAA AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTAG 100 

GAGATGGTAT GGTTCGTACT ATCGCCATGG AATCAACAGA TGGGTTGACT 150 

CGTGGAATGG AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG 200 

TAAAGAAACT TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACCATTGACT 2 50 

60 TGGAAGCTCC TTTTACAGAA GACGCAGAGC GTCAGCCAAT TCATAAAAAA 3 00 
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GCTCCAACTT 
GATCAAGGTT 
GACTTTTCGG 
ATTCACAACA 
TGGGGAACGT 
CAGGCGTTAT 
CCAGGAGCAC 
CTTCCGTGAT 
TCCGTTTCAC 
CCATCAGCCG 
GCAAGAACGT 



TTGATGAGTT 
ATTGACCTTC 
TGGTGCCGGA 
TTGCCCAAGA 
ACTCGTGAGG 
CGAGAAAACA 
GTATGCGTGT 
GTGGAAGGCC 
TCAGGCTGGT 
TTGGTTACCA 
ATCACATCAA 



GTCTACCTCT 
TTGCCCCTTA 
GTTGGTAAAA 
GCACGGTGGT 
GGAATGACCT 
GCCATGGTCT 
TGCCCTTACT 
AAGACGTGCT 
TCAGAAGTAT 
ACCAACACTT 
CCAAGAAGGG 



TCTGAAATCC TTGAAACAGG 
CCTTAAAGGT GGTAAAGTTG 
CTGTCTTAAT CCAAGAATTG 
ATTTCAGTAT TTGCTGGTGT 
TTACTGGGAA ATGAAAGAAT 
TTGGTCAGAT GAATGAGCCA 
GGTTTGACAA TCGCTGAATA 
TCTCTTTATC GATAATATCT 
CTGCCCTTTT GGGTCGTATG 
GCTACGGAAA TGGGTCAATT 
TTCTGTAACC TCTA 



350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
844 
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2) INFORMATION FOR SEQ ID NO: 955 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 955 

AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA AAATGACGAA 50 

AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTAG GAGATGGTAT 100 

GGTTCGTACT ATCGCCATGG AATCAACAGA TGGGTTGACT CGTGGAATGG 150 

AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG TAAAGAAACT 2 00 

TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACCATTGATT TGGAAGCTCC 250 

TTTTACAGAA GACGCAGAGC GTCAGCCAAT TCATAAAAAA GCTCCAACTT 300 

TTGATGAGTT GTCTACCTCT TCTGAAATCC TTGAAACAGG GATCAAGGTT 3 50 

ATTGACCTTC TTGCCCCTTA CCTTAAAGGT GGTAAAGTTG GACTTTTCGG 400 

TGGTGCCGGA GTTGGTAAAA CCGTCTTAAT CCAAGAATTG ATTCACAACA 450 

TTGCCCAAGA GCACGGTGGT ATTTCAGTAT TTACTGGTGT TGGGGAACGT 500 

ACTCGTGAGG GGAATGACCT TTACTGGGAA ATGAAAGAAT CAGGCGTTAT 550 

CGAGAAAACA GCCATGGTCT TTGGTCAGAT GAATGAGCCA CCAGGAGCAC 600 

GTATGCGTGT TGCCCTTACT GGTTTGACAA TCGCTGAATA CTTCCGTGAT 650 

GTGGAAGGCC AAGACGTGCT TCTCTTTATC GATAATATCT TCCGTTTCAC 700 

TCAGGCTGGT TCAGAAGTAT CTGCCCTTTT GGGTCGTATG CCATCAGCCG 750 

TTGGTTACCA ACCAACACTT GCTACGGAAA TGGGTCAATT GCAAGAACGT 800 

ATCACATCAA CCAAGAAGGG TTCTGTAACC TCTA 834 



2) INFORMATION FOR SEQ ID NO: 956 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 495 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia microti 

(B) STRAIN: Persing-1 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 956 

TTGTATATCA CAGGCACTCA GCAAATATTC CGATACTGAC GTAATTATAT 50 

ACGTGGGTTG TGGTGAACGT GGGAATGAAA TGGCTGAGAT TCTATGCGAA 100 

TTCCCTGAAC TATCTACTGT AGTTAATGAT GAAAAGGTGG CCATTATGGA 150 

10 ACGTACATGC TTAGTTGCCA ATACTTCTAA TATGCCAGTG GCCGCTAGAG 2 00 

AAGCTAGTAT ATACACTGGT ATTACAATTG CTGAATATTT CCGTGATATG 250 

GGTTACAACT GCACTCTTAT GGCCGATTCC ACTAGCCGAT GGGCAGAGGC 3 00 

TCTAAGGGAA ATTTCTGGTA GATTGGCTGA AATGCCTGCA GATTCTGGCT 3 50 

ATCCGGCCTA TTTATCGTCA AGGTTGTCAG CTTTTTATGA ACGTGCAGGT 400 

15 GGGATAACTG TCTAATTAAT TTAGGCTTGA TTAAGTGCTT AGGTTCACCA 450 

ACACGAACCG GATCTATTAC GGTTGTAGGA GCAGTTTCTC CACCA 4 95 



20 2) INFORMATION FOR SEQ ID NO: 957 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Entamoeba histolytica 

(B) STRAIN: HM1-1MSS 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 957 



AGTTATTTCA CAAGCATTAA GTAAATATAG TAATTCAGAT GTTATTATTT 50 

ATGTAGGATG TGGTGAACGA GGAAATGAAA TGGCAGAAGT TCTTCGAGAT 100 

TTTCCAGCTC TTTCTATTAA AGTAGGAGAT AAAGAAGAAT CTATTATGAC 150 

AAGAACAGCA CTTGTTGCTA ATACATCTAA TATGCCTGTT GCAGCACGTG 200 

40 AAGCATCAAT TTATACTGGA ATTACATTAT CAGAATATTA TAGAGATATG 250 

GGATATAATG TTGCTATGAT GGCAGATTCA ACATCAAGAT GGGCTGAAGC 300 

ACTTAGAGAA ATTTCAGGAC GTCTTGCAGA AATGCCAGCT GATTCTGGAT 350 

ATCCAGCATA TCTTGCAGCA CGTTTAGCAT CATTTTATGA ACGTGCAGGT 400 

ATGGTTGAAT GTTTAGGATC ACCAAAAAGA ATAGGGTCAG TTTCTATTGT 450 

45 AGGAGCTGTT TCACCACCT 4 69 



2) INFORMATION FOR SEQ ID NO: 958 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 452 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: FusoJbacterium nuclea turn subsp. polymorphum 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 958 

5 TACAACACCA ACTTGCTAAA TGGGCAGATG CAGAAGTAGT TGTTTATGTT 50 
GGTTGTGGGG AACGTGGAAA TGAAATGACC GATGTACTTA TGGAATTCCC * 100 

AGAAATTATT GACCCTAAGA CAGGACAATC TTTAATGAAG AGAACAGTTC 150 

TTATAGCTAA TACTTCTAAT ATGCCAGTTG CTGCTCGTGA GGCTTCAATC 200 

TATACTGGTA TAACTATTGC AGAATATTTT AGAGATATGG GATATTCAGT 250 

10 GGCACTTATG GCAGATTCAA CAAGTCGTTG GGCAGAAGCA CTTCGTGAAA 300 

TGTCAGGACG TTTGGAAGAA ATGCCAGGTG ATGAAGGATA TCCAGCATAT 350 

CTATCAAGTA GAATAGCAGA GTTTTATGAA AGAGCAGGGC TTGTTGAATG 4 00 

TCTAGGTAAT GGAGAAGAAG GAGCATTAAC TGTAATTGGA GCAGTATCTC 450 

CA 452 

15 



2) INFORMATION FOR SEQ ID NO: 959 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



45 



60 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania aethiopica 
30 (B) STRAIN: ATCC 50119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 959 

TGTAATTAGT CAGGCCCTCT CCAAGTACTC CAACTCCGAC TGCGTCATCT 50 

35 ATGTCGGCTG CGGCGAGCGC GGTAATGAGA TGGCCGAGGT GCTCATGGAG 100 

TTCCCGACCC TGACGACCGT GATCAATGGT CGCGAGGAGT CGATCATGAA 150 

GCGCACCTGC CTCGTGGCGA ACACTTCGAA CATGCCAGTC GCAGCCCGTG 2 00 

AGGCCTCTAT TTACACCGGC ATCACCCTGG CCGAGTACTA CCGTGATATG 2 50 

GGCAAGCATA TCGCCATGAT GGCCGACTCG ACGTCTCGCT GGGCCGAGGC 3 00 

40 GCTTCGTGAG ATTTCGGGTC GTCTGGCGGA GATGCCGGCC GATGGTGGCT 3 50 

ACCCTGCCTA TCTCAGCGCT CGTCTCGCCT CCTTCTACGA GCGCGCCGGC 4 00 

CTCGTCACCT GCATCGGCGG GCCGAAGCGC CAGGGCTCCG TCACGATTGT 450 

CGGTGCCGTG TCTCCGCCG 4 69 



2) INFORMATION FOR SEQ ID NO: 960 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 4 69 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANI SM : Lei shmania tropi ca 

(B) STRAIN: ATCC 30815 
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(xi) SEQUENCE DESCRIPTION: SEQ 

TGTAATTAGT CAGGCCCTCT CCAAGTACTC 
ATGTCGGCTG CGGCGAGCGC GGTAATGAGA 
5 TTCCCGACCC TGACGACCGT GATCGATGGT 
GCGCACCTGC CTCGTGGCGA ACACTTCGAA 
AGGCCTCTAT TTACACCGGC ATCACCCTGG 
GGCAAGCATA TCGCCATGAT GGCCGACTCG 
GCTTCGTGAG ATTTCGGGTC GTTTGGCGGA 
10 ACCCTGCCTA TCTCAGCGCT CGTCTCGCCT 
CTCGTCACCT GCATCGGCGG GCCGAAGCGC 
CGGTGCCGTG TCTCCGCCG 



PCT/CA00/01150 

ID NO: 960 

CAACTCCGAC TGCGTCATCT . 50 
TGGCCGAGGT GCTCATGGAG 100 
CGCGAGGAGT CGATCATGAA 150 
CATGCCAGTC GCAGCCCGTG 2 00 

CCGAGTACTA CCGTGATATG 250 
ACGTCTCGCT GGGCCGAGGC 300 
GATGCCGGCC GATGGTGGCT 350 
CCTTCTACGA GCGCGCCGGC 400 
CAGGGCTCCG TCACGATTGT 450 

469 



15 



25 



2) INFORMATION FOR SEQ ID NO: 961 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 469 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania guyanensis 

(B) STRAIN: ATCC 50126 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 961 

TGTCATCAGT CAGGCCCTCT CCAAGTACTC CAACTCGGAC TGTGTCATCT 50 

ATGTCGGCTG CGGTGAACGC GGTAACGAGA TGGCCGAGGT GCTCATGGAG 100 

TTCCCGACCC TGACGACTGT GATCGATGGT CGCGAAGAGT CCATCATGAA 150 

35 GCGCACCTGC CTCGTGGCGA ACACCTCGAA CATGCCCGTC GCAGCCCGTG 2 00 

AGGCCTCTAT TTATACCGGC ATCACCCTTG CTGAGTACTA CCGTGATATG 250 

GGCAAGCACA TTGCCATGAT GGCCGACTCG ACGTCTCGCT GGGCCGAGGC 300 

GCTGCGTGAG ATTTCGGGTC GATTGGCGGA GATGCCGGCT GATGGTGGCT 350 

ACCCTGCCTA CCTCAGCGCC CGCCTCGCCT CCTTCTACGA GCGCGCCGGT 4 00 

40 CTCGTCACCT GCATCGGCGG GCCGAAGCGC CAGGGCTCCG TCACGATCGT 450 

CGGTGCAGTG TCTCCACCG 469 



45 2) INFORMATION FOR SEQ ID NO: 962 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania donovani 

(B) STRAIN: ATCC 50212 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 962 
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TGTCATTAGT CAGGCCCTCT 
ATGTCGGCTG CGGCGAGCGC 
TTCCCGACCC TGACGACCGT 
GCGCACCTGC CTCGTGGCGA 
AGGCCTCTAT TTACACCGGC 
GGCAAGCATA TCGCCATGAT 
GCTTCGTGAG ATTTCGGGTC 
ACCCCGCCTA CCTCAGCGCT 
CTCGTCACCT GCATCGGCGG 
CGGTGCCGTG TCTCCACCG 



CCAAGTACTC 
GGTAATGAGA 
GATCGATGGC 
ACACCTCGAA 
ATCACCCTGG 
GGCTGACTCG 
GTCTGGCGGA 
CGTCTCGCCT 
GCCGAAGCGC 



CAACTCCGAT TGCGTCATCT 
TGGCCGAGGT GCTCATGGAG 
CGCGAGGAGT CGATCATGAA 
CATGCCAGTC GCAGCCCGTG 
CCGAGTACTA CCGTGATATG 
ACGTCTCGCT GGGCCGAGGC 
GATGCCGGCC GATGGTGGCT 
CCTTCTACGA GCGCGCCGGC 
CAGGGCTCCG TCACGATCGT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
469 



2) INFORMATION FOR SEQ ID NO: 963 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania hertigi 

(B) STRAIN: ATCC 50125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 963 

TGTGATCAGT CAGGCCCTTT CCAAGTACTC CAACTCGGAC TGCGTCATCT 50 

ACGTCGGCTG TGGTGAGCGC GGGAACGAGA TGGCCGAGGT GCTCATGGAT 100 

TTCCCGACTT TGACGACTGT GATCGATGGT CGCGAGGAGT CCATCATGAA 150 

GCGCACCTGC CTCGTGGCGA ACACCTCCAA CATGCCAGTT GCAGCCCGTG 200 

AGGCTTCTAT CTATACCGGC ATCACACTGG CTGAGTACTA TCGTGATATG 250 

GGCAAGCACA TTGCCATGAT GGCCGACTCG ACGTCTCGCT GGGCCGAGGC 300 

GTTGCGTGAG ATTTCCGGTC GGCTGGCGGA GATGCCGGCC GATGGTGGTT 350 

ACCCCGCCTA CCTCAGTGCC CGTCTCGCCT CCTTCTACGA GCGCGCTGGC 400 

CTCGTGACCT GTATCGGCGG GCCGAAGCGC CAGGGCTCCG TCACAATTGT 450 

TGGTGCGGTG TCTCCACCG 469 



2) INFORMATION FOR SEQ ID NO: 964 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania mexicana 

(B) STRAIN: ATCC 50156 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 964 

GGTCATTAGT CAGGCCCTTT CCAAGTACTC CAACTCTGAC TGCGTCATCT 50 
ACGTCGGCTG CGGCGAGCGC GGTAATGAGA TGGCCGAGGT GCTCATGGAG 100 
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TTCCCGACCC TGACGACCAT 
GCGCACCTGC CTCGTGGCGA 
AGGCCTCTAT CTACACCGGC 
GGCAAGCATA TCGCCATGAT 
GCTTCGTGAG ATTTCGGGTC 
ACCCCGCCTA CCTCAGCGCT 
CTCGTCACCT GCATCGGCGG 
CGGTGCCGTG TCTCCGCCG 



GATCGATGGT 
ACACCTCGAA 
ATCACCCTCG 
GGCCGACTCG 
GTCTGGCGGA 
CGTCTCGCCT 
GCCGAAGCGC 



CGGGAGGAGT CGATCATGAA 
CATGCCCGTC GCAGCCCGTG 
CCGAGTACTA CCGTGATATG 
ACGTCTCGCT GGGCTGAGGC 
GATGCCGGCC GATGGTGGCT 
CCTTCTACGA GCGCGCCGGC 
CAGGGCTCCG TCACGATCGT 



150 
200 
250 
300 
350 
400 
450 
469 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 965 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 50129 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 965 

TGTAATTAGT CAGGCCCTCT CCAAGTACTC CAACTCCGAC TGCGTCATCT 50 

ATGTCGGCTG CGGCGAGCGC GGTAATGAGA TGGCCGAGGT GCTCATGGAG 100 

TTCCCGACCC TGACGACCGT GATCGATGGT CGCGAGGAGT CGATCATGAA 150 

GCGCACCTGC CTCGTGGCGA ACACTTCGAA CATGCCAGTC GCAGCCCGTG 2 00 

AGGCCTCTAT TTACACCGGC ATCACCCTGG CCGAGTACTA CCGTGATATG 250 

GGCAAGCATA TCGCCATGAT GGCCGACTCG ACGTCTCGCT ' GGGCCGAGGC 300 

GCTTCGTGAG ATTTCGGGTC GTTTGGCGGA GATGCCGGCC GATGGTGGCT 350 

ACCCTGCCTA TCTCAGCGCT CGTCTCGCCT CCTTCTACGA GCGCGCCGGC 400 

CTCGTCACCT GCATCGGCGG GCCGAAGCGC CAGGGCTCCG TCACGATTGT 450 

CGGTGCCGTG TCTCCGCCG 469 



2) INFORMATION FOR SEQ ID NO: 966 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 449 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus anaerobius 

(B) STRAIN: ATCC 27337 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 966 

CACCAGTTCG CCAAGTGGGC AGATGCTCAG ATAGTTGTAT ACGTTGGTTG 50 

TGGAGAACGT GGTAACGAGA TGACAGACGT TCTAAATGAA TTCCCAGAAC 100 

TGATTGACCC TCATACAGGC GAATCTCTAA TGAAGAGAAC AGTTCTTATA 150 

GCTAATACGT CAAATATGCC AGTTGCAGCC AGAGAGGCAA GTATATATAC 2 00 
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AGGTATTACA ATAGCTGAAT ATTTTAGAGA 
TAATGGCCGA CTCTACATCA AGATGGGCAG 
GGTCGTCTAG AAGAAATGCC TGGTGATGAA 
TTCTAGAGCT GCAGAGTTCT ATGAAAGAGC 
GTAGCGATAA TAGAGAGGGA GCCCTTACAA 



CATGGGATAT TCAGTAGCGG 
AGGCCCTAAG AGAGATGTCA 
GGTTATCCAG CCTATCTAGG 
AGGTAAGGTA ATATGTAAGG 
TAATAGGTGC CGTGTCACC 



250 
300 
350 
400 
449 



2) INFORMATION FOR SEQ ID NO: 967 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bordetella pertussis 

(B) STRAIN: ATCC 9797 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 967 

CTGGTGGTGT CGGCCGCAGA CGGCCCGATG CCGCAGACGC GCGAGCACAT 50 

TTTGCTGTCG CGCCAGGTTG GCGTGCCGTA CATCATCGTG TTCCTGAACA 100 

AGGCGGACAT GGTTGATGAC GCGGAGCTGC TCGAGCTGGT GAAGATGGAA 150 

GTCCGCGAAC TGCTGAGCAA GTACGATTTC CCGGGCGATG ACACGCCGAT 200 

CGTGAAGGGT TCGGCCAAGC TGGCGCTGGA AGGCGACAAG GGCGAACTGG 2 50 

GCGAGCAGGC GATTCTGTCG CTGGCGCAAG CGCTGGACAC GTACATTCCG 3 00 

ACGCCGGAGC GCGCGGTCGA CGGTGCGTTC CTGATGCCGG TGGAAGACGT 3 50 

GTTCTCGATC TCGGGCCGTG GCACGGTGGT GACTGGCCGT ATCGAGCGCG 4 00 

GCGTGGTGAA GGTTGGCGAG GAAAT CGAAA TCGTGGGCAT CAAGCCGACG 450 
GTGAAGACGA CCTGCACGGG CGTGGAGATG TTCCGCAAGC TGCTGGACCA . 500 

GGGCCAGGCG GGCGACAACG TGGGTATCTT GCTGCGCGGC ACCAAGCGTG 550 

AAGACGTCGA GCGTGGCCAG GTGCTGGCCA AGCCGGGTTC GATCAACCCG 600 

CACACGGACT TCACGGCCGA GGTGTACATT CTGTCCAAGG AAGAGGGTGG 650 

CCGTCACACG CCGTTCTTCA ACGGCTATCG TCCGCAGTTC TACTTCCGCA 700 

CGACGGACGT GACCGGCACG ATCGACCTGC CGGCGGACAA GGAAATGGTG 750 

CTGCCGGGCG ACAACGTGTC GATGACCGTC AAGCTGCTGG CCCCGATCGC 800 

CATGGAAGAA GGTCTGCGTT TCGCCA 826 



2) INFORMATION FOR SEQ ID NO: 968 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bordetella pertussis 

(B) STRAIN: BD180 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 968 
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CGATCCTGGT 
CACATTTTGC 
GAACAAGGCG 
TGGAAGTCCG 
CCGATCGTGA 
ACTGGGCGAG 
TTCCGACGCC 
GACGTGTTCT 
GCGCGGCGTG 
CGACGGTGAA 
GACCAGGGCC 
GCGTGAAGAC 
ACCCGCACAC 
GGTGGCCGTC 
CCGCACGACG 
TGGTGCTGCC 
ATCGCCATGG 



GGTGTCGGCC 
TGTCGCGCCA 
GACATGGTTG 
CGAACTGCTG 
AGGGTTCGGC 
CAGGCGATTC 
GGAGCGCGCG 
CGATCTCGGG 
GTGAAGGTTG 
GACGACCTGC 
AGGCGGGCGA 
GTCGAGCGTG 
GGACTTCACG 
ACACGCCGTT 
GACGTGACCG 
GGGCGACAAC 
AAGAAGG 



GCAGACGGCC 
GGTTGGCGTG 
ATGACGCGGA 
AGCAAGTACG 
CAAGCTGGCG 
TGTCGCTGGC 
GTCGACGGTG 
CCGTGGCACG 
GCGAGGAAAT 
ACGGGCGTGG 
CAACGTGGGT 
GCCAGGTGCT 
GCCGAGGTGT 
CTTCAACGGC 
GCACGATCGA 
GTGTCGATGA 



CGATGCCGCA GACGCGCGAG 
CCGTACATCA TCGTGTTCCT 
GCTGCTCGAG CTGGTGGAGA 
ATTTCCCGGG CGATGACACG 
CTGGAAGGCG ACAAGGGCGA 
GCAAGCGCTG GACACGTACA 
CGTTCCTGAT GCCGGTGGAA 
GTGGTGACTG GCCGTATCGA 
CGAAATCGTG GGCATCAAGC 
AGATGTTCCG CAAGCTGCTG 
ATCTTGCTGC GCGGCACCAA 
GGCCAAGCCG GGTTCGATCA 
ACATTCTGTC CAAGGAAGAG 
TATCGTCCGC AGTTCTACTT 
CCTGCCGGCG GACAAGGAAA 
CCGTCAAGCT GCTGGCCCCG 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
817 



2) INFORMATION FOR SEQ ID NO: 969 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 7 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: JSnterococcus columbae 

(B) STRAIN: ATCC 51263 - 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 969 

CCTATGCCAC AAACTCGTGA ACACATTCTT TTATCACGTA ACGTTGGTGT 50 

GCCATACATC GTTGTTTTCT TAAACAAAGT TGATATGGTT GACGACGAAG 100 

AATTATTAGA ATTAGTTGAA ATGGAAGTTC GTGACTTATT AACTGAATAT 150 

GACTTCCCAG GAGACGATGT TCCTGTAATC GCTGGTTCTG CATTAAAAGC 2 00 

TTTAGAAGGC GACCCTGCTT ACGAAGAAAA AATCTTAGAA TTAATGGCTG 2 50 

CAGTTGACGA ATACATCCCA ACTCCAGAAC GTGACAACGA CAAACCATTC 300 

ATGATGCCAG TTGAAGACGT GTTCTCAATT ACTGGTCGTG GTACTGTTGC 350 

TACAGGTCGT GTTGAACGTG GACAAGTTCG TGTTGGTGAC GAAGTTGAAA 400 

TCGTTGGTAT CGCTGACGAA ACTTCTAAAA CAACAGTTAC TGGTGTTGAA 450 

ATGTTCCGTA AATTATTAGA TTACGCTGAA GCTGGAGACA ACATCGGTGC 500 

ATTATTACGT GGTGTGGCTC GTGAAGACAT CCAACGTGGT CAAGTATTAG 550 

CTAAACCAGG TTCAATCACT CCACATACAA AATTCACTGC TGAAGTGTAC 600 

GTTTTAACTA AAGAAGAAGG TGGACGTCAT ACTCCAT 637 



2) INFORMATION FOR SEQ ID NO: 970 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 634 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 
5 (B) STRAIN: ATCC 49997 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 970 

TATGCCTCAA ACACGTGAAC ACATCTTGTT ATCACGTAAC GTTGGTGTAC 50 

10 CATACATCGT TGTTTTCTTA AACAAAATGG ATATGGTTGA TGACGAAGAA 100 

TTACTAGAAT TAGTTGAAAT GGAAGTTCGT GACTTATTGT CAGAATATGA 150 

CTTCCCAGGC GACGATGTTC CTGTAATCGC TGGTTCTGCT TTGAAAGCTC 200 

TTGAAGGCGA TGCTTCATAC GAAGAAAAAA TCATGGAATT AATGGCTGCA 250 

GTTGACGAAT ACGTTCCAAC TCCAGAACGT GACACTGACA AACCATTCAT 3 00 

15 GATGCCAGTC GAAGACGTAT TCTCAATCAC TGGACGTGGT ACTGTTGCTA 350 

CAGGCCGTGT TGAACGTGGA CAAGTTCGCG TTGGTGACGA AGTTGAAATC 400 

GTTGGTATTG CTGAAGAAAC TGCTAAAACA ACTGTAACTG GTGTTGAAAT 450 

GTTCCGTAAA TTGTTAGACT ATGCTGAAGC AGGGGATAAC ATTGGTGCAT 500 

TGCTACGTGG GGTTGCTCGT GAAGACATCC AACGTGGACA AGTATTAGCT 550 

2 0 AAAGCTGGTA CAATCACACC TCATACAAAA TTTAAAGCTG AAGTTTACGT 600 

TTTAACAAAA GAAGAAGGTG GACGTCACAC TCCA 634 



25 2) INFORMATION FOR SEQ ID NO: 971 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-55 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 971 



GGACCAATGC CACAAACTCG TGAGCACATC CTTCTTTCAC GTCAGGTTGG 50 

TGTTAAACAC CTTATCGTCT TCATGAACAA AGTTGACTTG GTTGACGACG 100 

AAGAATTGCT TGAATTGGTT GAAATGGAAA TCCGTGACCT ATTGTCAGAA 150 

TACGACTTCC CAGGTGACGA TCTTCCAGTT ATCCAAGGTT CAGCACTTAA 200 

45 AGCTCTTGAA GGTGACTCTA AATACGAAGA CATCGTTATG GAATTGATGA 250 

ACACAGTTGA TGAGTATATC CCAGAACCAG AACGTGACAC TGACAAACCA 300 

TTGCTTCTTC CAGTCGAGGA CGTATTCTCA ATCACTGGAC GTGGTACAGT 350 

TGCTTCAGGA CGTATCGACC GTGGTATCGT TAAAGTCAAC GACGAAATCG 400 

AAATCGTTGG TATCAAAGAA GAAACTCAAA AAGCAGTTGT TACTGGTGTT 450 

50 GAAATGTTCC GTAAACAACT TGACGAAGGT CTTGCTGGAG ATAACGTAGG 500 

TGTCCTTCTT CGTGGTGTTC AACGTGATGA AATCGAACGT GGACAAGTTA 550 

TCGCTAAACC AGGTTCAATC AACCCACACA CTAAATTCAA AGGTGAAGTC 600 

TACATCCTTA CTAAAGAAGA AGGTGGACGT CACACTCCAT TCTTCAACAA 650 

CTACCGTCCA CAATTCTACT TCCGTACTAC TGACGTTACA GGTTCAATCG 700 

55 AACTTCCAGC AGGTACTGAA ATGGTAATGC CTGGTGATAA CGTGACAATC 750 

GACGTTGAGT TGATTCACCC AATCGCCGTA GAACAAG 787 



60 2) INFORMATION FOR SEQ ID NO: 972 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 35401 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 972 



TAGTTGCTGC GACTGACGGC CCGATGCCGC 
CTGGGTCGTC AGGTAGGCGT TCCGTACATC 
CGACATGGTT GATGACGAAG AGCTGCTGGA 
GTGAACTTCT GTCTCAGTAC GACTTCCCGG 
CGTGGTTCTG CTCTGAAAGC GCTGGAAGGC 
AATCCTGGAA CTGGCTGGCT TCCTGGATTC 
GTGCGATTGA CAAGCCGTTC CTGCTGCCGA 
TCCGGTCGTG GTACCGTTGT TACCGGTCGT 
AGTTGGTGAA GAAGTTGAAA TCGTTGGTAT 
CCTGTACTGG CGTTGAAATG TTCCGCAAAC 
GGTGAGAACG TAGGTGTTCT GCTGCGTGGT 
ACGTGGTCAG GTACTGGCTA AGCCGGGCAC 
TCGAATCTGA AGTGTACATT CTGTCCAAAG 
CCGTTCTTCA AAGGCTACCG TCCGCAGTTC 
GACTGGTACC ATCGAACTGC CGGAAGGCGT 
ACAACATCAA AATGGTTGTT ACCCTGATCC 
GGT 



AGACTCGTGA GCACATCCTG 50 

ATCGTGTTCC TGAACAAATG 100 

ACTGGTTGAA ATGGAAGTTC 150 

GCGACGACAC TCCGATCGTT 2 00 

GACGCAGAGT GGGAAGCGAA 250 

TTACATTCCG GAACCAGAGC 300 

TCGAAGACGT ATTCTCCATC 3 50 

GTAGAACGCG GTATCATCAA 400 

CAAAGAGACT CAGAAGTCTA 450 

TGCTGGACGA AGGCCGTGCT 500 

ATCAAACGTG AAGAAATCGA 550 

CATCAAGCCG CACACCAAGT 600 

ATGAAGGCGG CCGTCATACT 650 

TACTTCCGTA CTACTGACGT 700 

AGAGATGGTA ATGCCGGGCG 750 

ACCCGATCGC GATGGACGAC 800 

803 



2) INFORMATION FOR SEQ ID NO: 973 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 762 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 43895 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 973 

ACTGACGGCC CGATGCCGCA GACTCGTGAG CACATCCTGC TGGGTCGTCA 50 

GGTAGGCGTT CCGTACATCA TCGTGTTCCT GAACAAATGC GACATGGTTG 100 

ATGACGAAGA GCTGCTGGAA CTGGTTGAAA TGGAAGTTCG TGAACTTCTG 150 

TCTCAGTACG ACTTCCCGGG CGACGACACT CCGATCGTTC GTGGTTCTGC 2 00 

TCTGAAAGCG CTGGAAGGCG ACGCAGAGTG GGAAGCGAAA ATCCTGGAAC 250 

TGGCTGGCTT CCTGGATTCC TACATTCCGG AACCAGAGCG TGCRATTGAC 3 00 

AAGCCGTTCC TGCTGCCGAT CGAAGACGTA TTCTCCATCT CCGGTCGTGG 350 

TACCGTTGTT ACCGGTCGTG TAGAACGCGG TATCATCAAA GTTGGTGAAG 4 00 

AAGTTGAAAT CGTTGGTATC AAAGAGACTC AGAAGTCTAC CTGTACTGGC 450 
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AGGTGTTCTG CTGCGTGGTA TCAAACGTGA 
TACTGGCTAA GCCGGGCACC ATCAAGCCGC 
GTGTACATTC TGTCCAAAGA TGAAGGCGGC 
5 AGGCTACCGT CCGCAGTTCT ACTTCCGTAC 
TTGAACTGCC GGAAGGCGTA GAGATGGTAA 
ATGGTTGTTA CC 
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GGCCGTGCTG GTGAGAACGT 500 

AGAAATCGAA CGTGGTCAGG 550 

ACACCAAGTT CGAATCTGAA 600 

CGTCATACTC CGTTCTTCAA 650 

TACTGACGTG ACTGGTACCA 700 

TGCCGGGCGA CAACATCAAA 750 

762 



10 

2) INFORMATION FOR SEQ ID NO: 974 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 804 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 11775 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 974 

CCTGGTAGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACT CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 100 

AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TTGAAATGGA 150 

3 0 AGTTCGTGAA CTTCTGTCTC AGTACGACTT CCCGGGCGAC GACACTCCGA 2 00 

-TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGACGC AGAGTGGGAA 2 50 

GCGAAAATCC TGGAACTGGC TGGCTTCCTG GATTCCTACA TTCCGGAACC 3 00 

AGAGCGTGCG ATTGACAAGC CGTTCCTGCT GCCGATCGAA GACGTATTCT 3 50 

CCATCTCCGG TCGTGGTACC GTTGTTACCG GTCGTGTAGA ACGCGGTATC 4 00 

35 ATCAAAGTTG GTGAAGAAGT TGAAATCGTT GGTATCAAAG AGACTCAGAA 4 50 

GTCTACCTGT ACTGGCGTTG AAATGTTCCG CAAACTGCTG GACGAAGGCC 500 

GTGCTGGTGA GAACGTAGGT GTTCTGCTGC GCGGTATCAA ACGTGAAGAA 550 

ATCGAACGTG GTCAGGTACT GGCTAAGCCG GGCACCATCA AGCCGCACAC 600 

CAAGTTCGAA TCTGAAGTGT ACATTCTGTC CAAAGATGAA GGCGGTCGTC 650 

40 ATACTCCGTT CTTCAAAGGC TACCGTCCGC AGTTCTACTT CCGTACTACT 700 

GACGTGACTG GTACCATCGA ACTGCCGGAA GGCGTAGAGA TGGTAATGCC 750 

GGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCG ATCGCGATGG 800 

ACGA 804 

45 

2) INFORMATION FOR SEQ ID NO: 975 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 25922 



60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 975 

GCGATCCTGG TAGTTGCTGC GACTGACGGC CCGATGCCGC AGACTCGTGA 50 

GCACATCCTG CTGGGTCGTC AGGTAGGCGT TCCGTACATC ATCGTGTTCC 100 

5 TGAACAAATG CGACATGGTT GATGACGAAG AGCTGCTGGA ACTGGTTGAA 150 

ATGGAAGTTC GTGAACTTCT GTCTCAGTAC GACTTCCCGG GCGACGACAC 200 

TCCGATCGTT CGTGGTTCTG CTCTGAAAGC GCTGGAAGGC GACGCAGAGT 250 

GGGAAGCGAA AATCCTGGAA CTGGCTGGCT TCCTGGATTC YTACATTCCG 300 

GAACCAGAGC GTGCGATTGA CAAGCCGTTC CTGCTGCCGA TCGAAGACGT 350 

10 ATTCTCCATC TCCGGTCGTG GTACCGTTGT TACCGGTCGT GTAGAACGCG 400 

GTATCATCAA AGTTGGTGAA GAAGTTGAAA TCGTTGGTAT CAAAGAGACT 450 

CAGAAGTCTA CCTGTACTGG CGTTGAAATG TTCCGCAAAC TGCTGGACGA 500 

AGGCCGTGCT GGTGAGAACG TAGGTGTTCT GCTGCGTGGT ATCAAACGTG 550 

AAGAAATCGA ACGTGGTCAG GTACTGGCTA AGCCGGGCAC CATCAAGCCG 600 

15 CACACCAAGT TCGAATCTGA AGTGTACATT CTGTCCAAAG ATGAAGGCGG 650 

TCGTCATACT CCGTTCTTCA AAGGCTACCG TCCGCAGTTC TACTTCCGTA 700 

CTACTGACGT GACTGGTACC ATCGAACTGC CGGAAGGTGT AGAGATGGTA 750 

ATGCCGGGCG AC AACAT CAA AATGGTTGTT ACCCTGATCC ACCCGATCGC 800 

GATG 804 

20 



2) INFORMATION FOR SEQ ID NO: 976 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium avium 
35 (B) STRAIN: Mavi-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 976 

GGCGCGATCC TGGTGGTCGC CGCCACCGAC GGCCCGATGC CGCAGACCCG 50 

40 TGAGCACGTG CTGCTCGCCC GTCAGGTCGG TGTGCCCTAC ATCCTGGTCG 100 

CCCTGAACAA GGCCGACATG GTCGACGACG AGGAGCTCCT CGAGCTCGTC 150 

GAGATGGAGG TCCGCGAGCT GCTGGCCGCC CAGGAGTTCG ACGAGGACGC 200 

CCCGGTGGTG CGGGTCTCGG CGCTCAAGGC GCTCGAGGGC GACGCCAAGT 2 50 

GGGTCGAGTC CGTCGAGCAG CTGATGGAGG CCGTCGACGA GTCGATCCCG 3 00 

45 GACCCGGTCC GCGAGACGGA GAAGCCGTTC CTGATGCCGG TGGAGGACGT 3 50 

CTTCACCATC ACCGGGCGTG GCACCGTGGT CACCGGTCGT GTCGAGCGCG 400 

GTGTGATCAA CGTGAACGAG GAAGTCGAGA TCGTCGGCAT CCGCCCGACC 450 

AGCACCAAGA CCACGGTCAC CGGTGTGGAR ATGTTCCGCA AGCTGCTCGA 500 

CCAGGGCCAG GCCGGTGACA ACGTCGGTCT GCTGCTGCGT GGTATCAAGC 550 

50 GTGAGGACGT CGAGCGCGGC CAGGTCGTGA CCAAGCCCGG CACCACCACG 600 

CCGCACACCG AGTTCGAGGG CCAGGTCTAC ATCCTGTCCA AGGACGAGGG 650 

CGGCCGGCAC ACGCCGTTCT TCAACAACTA CCGTCCGCAG TTCTACTTCC 700 

GCACCACCGA CGTGACCGGT GTGGTGACGC TGCCGGAGGG CACCGAGATG 750 

GTGATGCCCG GTGACAACAC CAACATCTCG GTGAAGCTGA TCCAGCCCGT 800 

55 CGCCATGGAC GACGGTSTGC GGTTC 825 



2) INFORMATION FOR SEQ ID NO: 977 

60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 977 

TATCCTTGTA GTAGCTTCAA CTGACGGACC AATGCCACAA ACTCGTGAGC 50 

ACATCCTTCT TTCACGTCAG GTTGGTGTTA AACACCTTAT CGTCTTCATG 100 

AACAAAGTTG ACTTGGTTGA CGACGAAGAA TTGCTTGAAT TGGTTGAAAT 150 

GGAAATCCGT GACCTATTGT CAGAATACGA CTTCCCAGGT GACGATCTTC 2 00 

CAGTTATCCA AGGTTCAGCA CTTAAAGCTC TTGAAGGTGA CTCTAAATAC 250 

GAAGACATCG TTATGGAATT GATGAACACA GTTGATGAGT ATATCCCAGA 300 

ACCAGAACGT GACACT GAC A AACCATTGCT TCTTCCAGTC GAGGACGTAT 350 

TCTCAATCAC TGGACGTGGT ACAGTTGCTT CAGGACGTAT CGACCGTGGT 400 

ATCGTTAAAG TCAACGACGA AATCGAAATT GTTGGTATCA AAGAAGAAAC 450 

TCAAAAAGCA GTTGTTACTG GTGTTGAAAT GTTCCGTAAA CAACTTGACG 500 

AAGGTCTTGC TGGAGATAAC GTAGGTGTCC TTCTTCGTGG TGTTCAACGT 550 

GATGAAATCG AACGTGGACA AGTTATCGCT AAACCAGGTT CAATCAACCC 600 

ACACACTAAA TTCAAAGGTG AAGTCTACAT CCTTACTAAA GAAGAAGGTG 650 

GACGTCACAC TCCATTCTTC AACAACTACC GTCCACAATT CTACTTCCGT 70 0 

ACTACTGACG TTACAGGTTC AATCGAACTT CCAGCAGGTA CTGAAATGGT 750 

AATGCCTGGT GATAACGTGA CAATCGACGT TGAGTTGATT CACCCAATCG 800 

CCGTAGAACA AGGTACTACA 820 



2) INFORMATION FOR SEQ ID NO: 978 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium gordonae 

(B) STRAIN: M-Gor-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 978 

GGCGCGATCC TGGTGGTCGC CGCCACCGAT GGCCCGATGC CGCAGACCCG 50 

TGAGCACGTG CTGCTCGCCC GTCAGGTGGG CGTGCCCTAC ATCCTGGTGG 100 

CGCTGAACAA GTCCGACGCG GTCGACGACG AGGAGCTGCT CGAGCTCGTC 150 

GAGCTGGAGG TCCGCGAGTT GCTGGCCGCC CAGGACTTCG ACGAGGAAGC 200 

TCCGGTGGTC CGGGTCTCGG CGCTGAAGGC GCTCGAGGGC GACGCCACCT 2 50 

GGGTGAAGTC GGTAGAGGAC TTGATGGACG CGGTCGACGA GTCGATTCCG 3 00 

GACCCGGTCC GCGACACCGA CAAGCCGTTC CTGATGCCCG TCGAGGACGT 350 

CTTCACCATC ACCGGTCGTG GCACCGTCGT CACCGGCCGT GTGGAGCGCG 400 

GCGTGGTGAA CGTGAACGAG GAAGTCGAGA TCGTCGGCAT CAAGCCGACC 450 

AGCACCAAGA CCACGGTCAC CGGTGTGGAG ATGTTCCGCA AGCTGCTCGA 500 

544 
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CCAGGGTCAG GCCGGTGACA ACGTCGGTCT 
GTGAGGACGT CGAGCGCGGC CAGGTCGTCA 
CCGCACACCG AGTTCGAGGG TCAGGTCTAC 
CGGCCGGCAC ACGCCGTTCT TCAACAACTA 
GCACCACCGA CGTGACCGGT GTGGTGACGC 
GTGATGCCCG GTGACAACAC CAACATCTCG 
CGCCATGGAC GACGGTCTGC GG 



PCT/CA00/01150 

GCTGCTGCGT GGTGTCAAGC 550 

TCAAGCCCGG CACCACCACT 600 

ATCCTGTCCA AGGACGAGGG 650 

CCGTCCGCAG TTCTACTTCC 700 

TGCCGGAGGG CACCGAAATG 750 

GTGAAGCTGA TCCAGCCCGT 800 

822 



2) INFORMATION FOR SEQ ID NO: 979 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneu/noniae 

(B) STRAIN: StrR-11 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 979 



CTATCCTTGT AGTAGCTTCA ACTGACGGAC 
CACATCCTTC TTTCACGTCA GGTTGGTGTT 
GAACAAAGTT GACTTGGTTG ACGACGAAGA 
TGGAAATCCG TGACCTATTG TCAGAATACG 
CCAGTTATCC AAGGTTCAGC ACTTAAAGCT 
CGAAGAC AT C GTTATGGAAT TGATGAACAC 
AACCAGAACG TGACACTGAC AAACCATTGC 
TTCTCAATCA CTGGACGTGG TACAGTTGCT 
TATCGTTAAA GTCAACGACG AAATCGAAAT 
CTCAAAAAGC AGTTGTTACT GGTGTTGAAA 
GAAGGTCTTG CTGGAGATAA CGTAGGTGTC 
TGATGAAATC GAACGTGGAC AAGTTATCGC 
CACACACTAA ATTCAAAGGT GAAGTCTACA 
GGACGTCACA CTCCATTCTT CAACAACTAC 
TACTACTGAC GTTACAGGTT CAATCGAACT 
TAATGCCTGG TGATAACGTG ACAATCGACG 
GCCGTAGAAC AAGGTACTAC A 



CAATGCCACA AACTCGTGAG 50 

AAACACCTTA TCGTCTTCAT 100 

ATTGCTTGAA TTGGTTGAAA 150 

ACTTCCCAGG TGACGATCTT 200 

CTTGAAGGTG ACTCTAAATA 250 

AGTTGATGAG TATATCCCAG 300 

TTCTTCCAGT CGAGGACGTA 350 

TCAGGACGTA TCGACCGTGG 4 00 

CGTTGGTATC AAAGAAGAAA 450 

TGTTCCGTAA ACAACTTGAC 500 

CTTCTTCGTG GTGTTCAACG 550 

TAAACCAGGT TCAATCAACC 60 0 

TCCTTACTAA AGAAGAAGGT 650 

CGTCCACAAT TCTACTTCCG 700 

TCCAGCAGGT ACTGAAATGG 750 

TTGAGTTGAT TCACCCAATC 8 00 

821 



2) INFORMATION FOR SEQ ID NO: 980 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(B) STRAIN: ATCC 25177 



545 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 980 

GGTGCGATCC TGGTGGTCGC CGCCACCGAC GGCCCGATGC CCCAGACCCG 50 

CGAGCACGTT CTGCTGGCGC GTCAAGTGGG TGTGCCCTAC ATCCTGGTAG 100 

5 CGCTGAACAA GGCCGACGCA GTGGACGACG AGGAGCTGCT CGAACTCGTC 150 

GAGATGGAGG TCCGCGAGCT GCTGGCTGCC CAGGAATTCG ACGAGGACGC 200 

CCCGGTTGTG CGGGTCTCGG CGCTCAAGGC GCTCGAGGGT GACGCGAAGT 250 

GGGTTGCCTC TGTCGAGGAA CTGATGAACG CGGTCGACGA GTCGATTCCG 300 

GACCCGGTCC GCGAGACCGA CAAGCCGTTC CTGATGCCGG TCGAGGACGT 350 

10 CTTCACCATT ACCGGCCGCG GAACCGTGGT CACCGGACGT GTGGAGCGCG 400 

GCGTGATCAA CGTGAACGAG GAAGTTGAGA TCGTCGGCAT TCGCCCATCG 450 

ACCACCAAGA CCACCGTCAC CGGTGTGGAG ATGTTCCGCA AGCTGCTCGA 500 

CCAGGGCCAG GCGGGCGACA ACGTTGGTTT GCTGCTGCGG GGCGTCAAGC 550 

GCGAGGACGT CGAGCGTGGC CAGGTTGTCA CCAAGCCCGG CACCACCACG 600 

15 CCGCACACCG AGTTCGAAGG CCAGGTCTAC ATCCTGTCCA AGGACGAGGG 650 

CGGCCGGCAC ACGCCGTTCT TCAACAACTA CCGTCCGCAG TTCTACTTCC 700 

GCACCACCGA CGTGACCGGT GTGGTGACAC TGCCGGAGGG CACCGAGATG 750 

GTGATGCCCG GTGACAACAC CAACATCTCG GTGAAGTTGA TCCAGCCCGT 800 

CGCCATGGAC GAAGGTNTGC GTTTCGCG 82 8 

20 



2) INFORMATION FOR SEQ ID NO: 981 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 
35 (B) STRAIN: CSG 144 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 981 

CGGCGGTATC TTAGTAGTAT CTGCTGCAGA TGGCCCAATG CCACAAACTC 50 

40 GTGAACACAT CTTATTATCA CGTAACGTTG GTGTACCAGC TTTAGTTGTA 100 

TTCTTAAACA AAGCTGACAT GGTTGACGAC GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTATCTGA ATACGACTTC CCTGGTGACG 200 

ATGTACCAGT TATCGTTGGT TCTGCATTAA AAGCTTTAGA AGGCGACCCA 250 

GAATACGAAC AAAAAATCTT AGACTTAATG CAAGCTGTAG ATGACTACAT 3 00 

45 CCCAACTCCA GAACGTGACT CTGATAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TAGCAACAGG CCGTGTTGAA 400 

CGTGGTCAAA TCAAAGTCGG TGAAGAAGTT GAAATCATCG GTATCACTGA 450 

AGAAAGCAAG AAAACAACAG TTACAGGTGT AGAAATGTTC CGTAAATTAT 500 

TAGACTACGC TGAAGCTGGT GACAACATCG GTGCTTTATT ACGTGGTGTT 550 

50 GCACGTGAAG ACGTACAACG TGGACAAGTA TTAGCAGCTC CTGGCTCTAT 600 

TACTCCACAC ACAAAATTCA AAGCTGATGT TTACGTTTTA TCTAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCACTA ACTACCGCCC ACAATTCTAC 700 

TTCCGTACTA CTGACGTAAC TGGCGTTGTT CACTTACCAG AAGGTACTGA 750 

AATGGTTATG CCTGGCGATA ACGTAGAAAT GACTGTTGAA TTAATCGCTC 800 

55 CAATCGCGAT TGAAGACGG 819 



60 



2) INFORMATION FOR SEQ ID NO: 982 

546 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus witis 

(B) STRAIN: LSPQ 2583 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 982 

CTATCCTTGT AGTAGCTTCA ACTGACGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA GGTTGGTGTT AAACACCTTA TCGTCTTCAT 100 

GAACAAGATC GACTTGGTTG ATGACGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATCCG TGACCTCTTG TCAGAATACG ACTTCCCAGG TGACGATCTT 200 

CCAGTTATCC AAGGTTCAGC TCTTAAAGCT CTTGAAGGTG ATACTAAGTA 250 

CGAAGACATC ATCATGGAAT TGATGAACAC TGTTGATGAG TACATCCCAG 300 

AACCAGAACG TGATACTGAC AAACCTCTTC TTCTTCCAGT CGAAGACGTA 350 

TTCTCAATCA CTGGTCGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 400 

TACTGTTCGT GTCAACGATG AAATCGAAAT CGTTGGTATC AAAGAAGAAA 450 

TCCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAGCTTGAC 500 

GAAGGTCTTG CAGGGGACAA CGTAGGTGTA CTTCTTCGTG GTATCCAACG 550 

TGATGAAATC GAACGTGGTC AAGTTATCGC TAAACCAGGT TCAATCAACC 600 

CACACACTAA ATTCAAGGGT GAAGTTTACA TCCTTACTAA AGAAGAAGGT 650 

GGACGTCACA CTCCATTCTT CAACAACTAC CGTCCACAGT TCTACTTCCG 700 

TACAACTGAC GTTACAGGTT CAATCGAACT TCCAGCAGGT ACTGAAATGG 750 

TAATGCCTGG TGATAACGTA ACTATCGACG TTGAGTTGAT CCACCCAATC 800 

GCCGTTGAAC AAGG 814 



2) INFORMATION FOR SEQ ID NO: 983 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 4 9456 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 983 

CTATCCTTGT AGTAGCTTCA ACTGACGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA GGTTGGTGTT AAACACCTTA TCGTCTTCAT 100 

GAACAAAGTT GACTTGGTTG ACGACGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATCCG TGACCTATTG TCAGAATACG ACTTCCCAGG TGACGATCTT 200 

CCAGTTATCC AAGGTTCAGC TCTTAAAGCC CTTGAAGGTG ACACTAAATA 250 

CGAAGACATC GTTATGGAAT TGATGAACAC AGTTGATGAG TACATCCCAG 3 00 

AACCAGAACG TGACACTGAC AAACCATTGC TTCTTCCAGT CGAAGACGTA 350 

TTCTCAATCA CTGGTCGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 400 

TATCGTTAAA GTCAACGACG AAATCGAAAT CGTTGGTATC AAAGAAGAAA 450 

CTCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 500 

547 
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GAAGGTCTTG CCGGAGATAA TGTAGGTGTC CTTCTTCGTG GTGTTCAACG 550 

TGATGAAATC GAACGTGGAC AAGTTATTGC TAAACCAGGT TCAATCAACC 600 

CACACACTAA ATTCAAAGGT GAAGTTTACA TCCTTACTAA AGAAGAAGGT 650 

GGACGTCACA CTCCATTCTT CAACAACTAC CGTCCACAAT TCTACTTCCG 700 

5 TACTACTGAC GTTACAGGTT CAATCGAACT TCCAGCAGGT ACTGAAATGG 750 

TAATGCCTGG TGATAACGTG ACAATCGACG TTGAGTTGAT CCACCCAATC 800 

GCCGTAGAAC 810 



10 



20 



45 



60 



2) INFORMATION FOR SEQ ID NO: 984 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 817 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 903 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 984 

TCCTTGTAGT AGCTTCAACT GACGGACCAA TGCCACAAAC TCGTGAGCAC 50 

ATCCTTCTTT CACGTCAGGT TGGTGTTAAA CACCTTATCG TCTTCATGAA 100 

CAAGATCGAC TTGGTTGATG ACGAAGAATT GCTTGAATTG GTTGAAATGG 150 

30 AAATCCGTGA CCTCTTGTCA GAATACGACT TCCCAGGTGA CGATCTTCCA 2 00 

GTTATCCAAG GTTCAGCTCT TAAAGCTCTT GAAGGTGATA CTAAGTACGA 250 

AGACATCATC ATGGAATTGA TGAACACTGT TGATGAGTAC ATCCCAGAAC 3 00 

CAGAACGTGA TACTGACAAA CCTCTTCTTC TTCCAGTCGA AGACGTATTC 3 50 

TCAATCACTG GTCGTGGTAC AGTTGCTTCA GGACGTATCG ACCGTGGTAC 4 00 

35 TGTTCGTGTC AACGATGAAA TCGAAATCGT TGGTATCAAA GAAGAAATCC 450 

AAAAAGCAGT TGTTACTGGT GTTGAAATGT TCCGTAAACA GCTTGACGAA 500 

GGTCTTGCAG GGGACAACGT AGGTGTACTT CTTCGTGGTA TCCAACGTGA 550 

TGAAATCGAA CGTGGTCAAG TTATCGCTAA ACCAGGTTCA ATCAACCCAC 600 

ACACTAAATT CAAGGGTGAA GTTTACATCC TTACTAAAGA AGAAGGTGGA 650 

4 0 CGTCACACTC CATTCTTCAA CAACTACCGT CCACAGTTCT ACTTCCGTAC 700 

AACTGACGTT ACAGGTTCAA TCGAACTTCC AGCAGGTACT GAAATGGTAA 750 

TGCCTGGTGA TAACGTAACT ATCGACGTTG AGTTGATCCA CCCAATCGCC 800 

GTTGAACAAG GTACTAC 817 



2) INFORMATION FOR SEQ ID NO: 985 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 7 98 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 
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GACGGACCAA TGCCACAAAC TCGTGAGCAC ATCCTTCTTT CACGTCAGGT 50 

TGGTGTTAAA CACCTTATTG TCTTCATGAA CAAAATTGAC TTGGTAGACG 100 

5 ACGAAGAATT GCTTGAATTG GTTGAAATGG AAATCCGTGA CCTCTTGTCA 150 

GAATACGACT TCCCAGGTGA CGATCTTCCA GTTATCCAAG GTTCAGCTCT 200 

TAAAGCTCTT GAAGGTGACT CTAAATACGA AGACATCATT ATGGAATTGA 250 

TGAACACTGT TGATGAGTAC ATCCCAGAAC CAGAACGTGA CACTGAAAAA 300 

CCATTGCTTC TTCCAGTCGA AGACGTATTC TCAATCACTG GACGTGGTAC 350 

10 AGTTGCTTCA GGACGTATCG ACCGTGGTAC TGTTCGTGTC AACGACGAAA 400 

TCGAAATCGT TGGTATCAAA GAAGAAACTC AAAAAGCAGT TGTTACTGGT 450 

GTTGAAATGT TCCGTAAACA ACTTGACGAA GGTCTTGCCG GAGATAACGT 500 

AGGTGTCCTT CTTCGTGGTG TTCAACGTGA CGAAATCGAA CGTGGACAAG 550 

TTATCGCTAA ACCAGGTTCA ATCAACCCAC ACACTAAATT TAAAGGTGAA 600 

15 GTCTACATCC TTACTAAAGA AGAAGGTGGA CGTCACACTC CATTCTTCAA 650 

CAACTACCGT CCACAATTCT ACTTCCGTAC TACTGACGTT ACAGGTTCAA 700 

TCGAACTTCC TGCAGGTACT GAAATGGTAA TGCCTGGTGA TAACGTGACT 750 

ATCGACGTTG AGTTGATCCA CCCAATCGCC GTAGAACAAG GTACTACA 798 



20 



35 



2) INFORMATION FOR SEQ ID NO: 986 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 815 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 986 



TTGTAGTAGC TTCAACTGAC GGACCAATGC CACAAACTCG TGAGCACATC 50 

CTTCTTTCAC GTCAGGTTGG TGTTAAACAC CTTATCGTCT TCATGAACAA 100 

40 AGTTGACTTG GTTGACGACG AAGAATTGCT TGAATTGGTT GAAATGGAAA 150 

TCCGTGACCT ATTGTCAGAA TACGACTTCC CAGGTGACGA TCTTCCAGTT 200 

ATCCAAGGTT CAGCACTTAA AGCTCTTGAA GGTGACTCTA AATACGAAGA 250 

CATCGTTATG GAATTGATGA ACACAGTTGA TGAGTATATT CCAGAACCAG 300 

AACGTGACAC TGACAAACCA TTGCTTCTTC CAGTCGAGGA CGTATTCTCA 3 50 

45 ATCACTGGAC GTGGTACAGT TGCTTCAGGA CGTATCGACC GTGGTATCGT 400 

TAAAGTCAAC GACGAAATCG AAATCGTTGG TATCAAAGAA GAAACTCAAA 450 

AAGCAGTTGT TACTGGTGTT GAAATGTTCC GTAAACAACT TGACGAAGGT 500 

CTTGCTGGAG ATAACGTAGG TGTCCTTCTT CGTGGTGTTC AACGTGATGA 550 

AATCGAACGT GGACAAGTTA TCGCTAAACC AGGTTCAATC AACCCACACA 600 

50 CTAAATTCAA AGGTGAAGTC TACATCCTTA CTAAAGAAGA AGGTGGACGT 650 

CACACTCCAT TCTTCAACAA CTACCGTCCA CAATTCTACT TCCGTACTAC 70 0 

TGACGTTACA GGTTCAATCG AACTTCCAGC AGGTACTGAA ATGGTAATGC 750 

CTGGTGATAA CGTGACAATC GACGTTGAGT TGATTCACCC AATCGCCGTA 800 

GAACAAGGTA CTACA 815 

55 



2) INFORMATION FOR SEQ ID NO: 987 

60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus hirae 
10 (B) STRAIN: ATCC 8043 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 987 

CGGACAATCT TGGTTGTTTC TGCTACGGAT GGCCCAATGC CTCAAACTCG 50 

15 TGAACACATT TTACTTTCTC GCCAAGTAGG CGTGAAATAT TTGATTGTTT 100 

TCTTGAACAA AACAGATTTA GTCGATGATG AAGAATTAAT TGATCTAGTA 150 

GAAATGGAAG TTCGTGAACT ATTAAGCGAA TATGGATTTC CAGGTGATGA 200 

TACACCTGTT ATCAAAGGAT CAGCATTAAA AGCATTACAA GGTGATCCTG 250 

ATGCAGAAGC AGCTATCATG GAATTGATGG ACACAGTCGA TGAATACATC 3 00 

20 CCAACACCAG AACGTGATAC GGACAAACCA TTATTGTTAC CCGCCGAAGA 350 

TGTATTTTCA ATTACTGGAC GTGGAACAGT AGCATCTGGT CGGATTGACC 400 

GTGGGGCTGT TAGAGTCGGT GACGAAATCG AAATCGTAGG GATCAAACCA 450 

GAAACGCAAA GAGCCGTAGT AACAGGAGTT GAAATGTTCC GCAAAACGCT 500 

TGATTACGGT GAAGCAGGGG ATAACGTAGG TGTGTTATTA CGTGGGATTC 550 

25 AAAGAGAAGA CATCGAACGT GGCCAAGTGA TTGCCAAACC TGGTTCAATT 600 

ACACCTCATA CTAAATTCAA AGCAGAAGTT TAGGTTTTGA CTAAAGAAGA 650 

AGGCGGACGT CATACACCAT TCTTCAATAA TTATCGACCA CAATTTTATT 700 

TCCGCACAAC TGACGTAACA GGAACAATTG TTTTGCCAGA AGGAACGGAA 750 

ATGGTCATGC CTGGCGACAA CGTAACGATC GATGTAGAAT TGATTCATCC 800 

30 TGTTGCTATT GAAAACGGGA CGACTTTCTC CA 832 



2) INFORMATION FOR SEQ ID NO: 988 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 "(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Enterococcus mundtii 

(B) STRAIN: ATCC 43186 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 988 

50 TGGTGCGATC TTAGTTGTTT CTGCGACAGA TGGACCAATG CCTCAAACTC 50 

GCGAGCATAT CCTTTTATCA CGTCAAGTTG GTGTAAAATA TTTGATTGTA 100 

TTTTTGAATA AAGTTGATTT GGTCGATGAT GAAGAATTGA TCGATCTTGT 150 

AGAAATGGAA GTTCGTGAAT TACTGAATGA ATATGGTTTC CCAGGTGATG 2 00 

ACACACCTGT CATCAAAGGC TCTGCATTGA AAGCATTGCA AGGTGACCCA 2 50 

55 GAAGCAGAAG CTGCGATCAA TGAATTGATG GAAACAGTGG ATGACTATAT 3 00 

CCCAACACCA GAACGTGATA CCGACAAACC ATTGCTTTTA CCAGTTGAAG 350 

ATGTTTTCTC AATCACTGGT CGCGGAACGG TAGCATCTGG TCGTATCGAC 400 

CGTGGAGCCG TTCGTGTCGG TGATGAAATC GAGATCATCG GAATCAAACC 450 

TGAAACGAAA AAAGCGGTTG TGACAGGGGT AGAGATGTTC CGTAAAACGT 500 

60 TAGATTATGG CGAAGCTGGA GATAACGTAG GAATCTTGTT ACGTGGTATC 550 

550 
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CAAAGAGAAG ATATTGAACG TGGACAAGTA 
CACACCACAT ACAAAATTCA AAGCGGAAGT 
AAGGCGGACG TCATACACCA TTCTTCAATA 
TTCCGCACAA CAGATGTAAC AGGTACGATC 
AATGGTCATG CCTGGAGACA ACGTAACCAT 
CAGTGGCAAT CGAACAAGGA ACGACTTTCT 



ATTGCGAAAC CTGGTTCAAT 
TTATGTATTG ACGAAAGAAG 
ACTACCGCCC ACAATTTTAT 
GTGTTGCCAG AAGGAACAGA 
CGAGGTAGAG TTGATCCATC 
CTATT 



600 
650 
700 
750 
800 
835 



2) INFORMATION FOR SEQ ID NO: 989 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 832 bases 
<B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus raffinosus 

(B) STRAIN: ATCC 49427 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 989 

TGCGATCTTA GTAGTATCTA CTACTGATGG ACCAATGCCT CAAACACGTG 50 

AGCACATTTT GTTATCACGC CAAGTAGGTG TTAAATACTT GATCGTCTTC 100 

TTGAACAAAG TTGATTTAGT CGACGATGAA GAATTGATTG ATTTAGTTGA 150 

AATGGAAGTA CGTGAGTTAC TTTCAGAATA TGGTTTCCCA GGCGATGATA 200 

TTCCTGTTCT TAAAGGTTCA GCTCTGAAAG CTTTAGAAGG CGATCCTGAA 250 

CAAGAACAAG TAATCATGGA CCTAATGGAT ACGGTTGACG AATATATCCC 3 00 

AACAC C AGAA CGTGACACTG ACAAACCATT CTTGTTACCA GTGGAAGATG 350 

TTTTCTCAAT CACAGGACGT GGGACTGTTG CATCTGGTCG TATTGATCGT 400 

GGGGAAGTTA AAGTCGGTGA CGAAGTTGAA ATTATCGGGA TCAAACCTGA 450 

AGTTCAAAAG GCTGTCGTAA CTGGACTTGA AATGTTCCGT AAAACATTGG 500 

ATTATGGTGA AGCTGGAGAT AACGTTGGGG TTCTATTACG TGGTATTACA 550 

CGTGATGAAA TCGAACGTGG TCAAGTATTA GCTAAACCAG GTTCAATTAC 600 

ACCACATACG AAATTCAGTG CAGAAGTTTA TGTGTTGACG AAAGAAGAAG 650 

GTGGACGTCA TACACCATTC TTTAACAACT ATCGTCCTCA ATTCTACTTC 700 

CGTACAACAG ACGTTACCGG TAATATCGTG TTGCCAGAAG GTACTGAAAT 750 

GGTCATGCCT GGCGATAACG TAACAATCGA CGTTGAATTA ATCCATCCAA 800 

TCGCCGTAGA AAAAGGAACA ACTTTCTCTA TT 832 



2) INFORMATION FOR SEQ ID NO: 990 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9440 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 990 
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GGATCCTGTA TATGCACAAA AACTAGGTGT 
TATCACAACC TGATACAGGG GAGCAAGGTT 
GTACGAAGTG GTGCGGTTGA TATTATCGTA 
5 TGTA 



TAACATCGAT GAATTACTAT 50 
TAGAAATCGC AGAAGCACTT 100 
ATTGACTCTG TAGCAGCTCT 150 

154 



2) INFORMATION FOR SEQ ID NO: 991 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Prevotella melaninogenica 

(B) STRAIN: ATCC 25845 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 991 

25 GCCATTGCAG AGGCACAGAA GCAGGGCGGT ATTGCAGCCT TCATTGATGC 50 

TGAGCACGCC TTCGACCGTT TCTATGCAGA GAAGTTAGGT GTGGATGTTG 100 

ATAACCTTTG GGTTTCACAG CCAGACAATG GTGAGCAGGC TTTAGAGATT 150 

GCCGACCAGC TGATTCGCTC TTCCGCTATT GACATTCTCG TTGTCGACTC 2 00 

AGTTGCAGCC TTGACTCCAA AGAAGGAGAT TGAGGGTGAC ATGGGTGACT 2 50 

3 0 CTGCAGTAGG TTTACAAGCA CGACTGATGA GTCAGGCATT GCGTAAACTT 3 00 

ACCTCAACAA TCGCAAAAAC TAATACTTGC TGCATCTTCA TCAACCAGTT 350 

GCGTGAGAAG ATTGGTGTGA TGTTTGGTAA TCCA 384 



35 



45 



2) INFORMATION FOR SEQ ID NO: 992 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 624 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: R760 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 992 

ACACGTGAAC ACATCTTGTT ATCACGTAAC GTTGGTGTAC CATACATCGT 50 

TGTTTTCTTA AACAAAATGG ATATGGTTGA TGACGAAGAA TTACTAGAAT 100 

TAGTTGAAAT GGAAGTTCGT GACTTATTGT CAGAATATGA CTTCCCAGGC 150 

55 GACGATGTTC CTGTAATCGC TGGTTCTGCT TTGAAAGCTC TTGAAGGCGA 200 

TGCTTCATAC GAAGAAAAAA TCATGGAATT AATGGCTGCA GTTGACGAAT 250 

ACGTTCCAAC TCCAGAACGT GACACTGACA AACCATTCAT GATGCCAGTC 3 00 

GAAGACGTAT TCTCAATCAC TGGACGTGGT ACTGTTGCTA CAGGCCGTGT 350 

TGAACGTGGA CAAGTTCGCG TTGGTGACGA AGTTGAAATC GTTGGTATTG 4 00 

60 CTGAAGAAAC TGCTAAAACA ACTGTAACTG GTGTTGAAAT GTTCCGTAAA 450 

552 
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TTGTTAGACT ATGCTGAAGC AGGGGATAAC ATTGGTGCAT TGCTACGTGG 

TGTTGCTCGT GAAGACATCC AACGTGGACA AGTATTGGCT AAAGCTGGTA 

CAATCACACC TCATACAAAA TTTAAAGCTG AAGTTTACGT TTTAACAAAA 
GAAGAAGGTG GACGTCATAC ACCA 



500 
550 
600 
624 
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2) INFORMATION FOR SEQ ID NO: 993 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 756 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 

(C) ACCESSION NUMBER: extracted from U40453 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 993 

ATGGAAAACA ATAAAAAAGT ATTGAAGAAA ATGGTATTTT TTGTTTTAGT 50 

GACATTTCTT GGACTAACAA TCTCGCAAGA GGTATTTGCT CAACAAGACC 100 

CCGATCCAAG CCAACTTCAC AGATCTAGTT TAGTTAAAAA CCTTCAAAAT 150 

ATATATTTTC TTTATGAGGG TGACCCTGTT ACTCACGAGA ATGTGAAATC 2 00 

TGTTGATCAA CTTTTATCTC ACGATTTAAT ATATAATGTT TCAGGGCCAA 250 

ATTATGATAA ATTAAAAACT GAACTTAAGA ACCAAGAGAT GGCAACTTTA 3 00 

TTTAAGGATA AAAACGTTGA TATTTATGGT GTAGAATATT ACCATCTCTG 350 

TTATTTATGT GAAAATGCAG AAAGGAGTGC ATGTATCTAC GGAGGGGTAA 400 

CAAATCATGA AGGGAATCAT TTAGAAATTC CTAAAAAGAT AGTCGTTAAA 450 

GTATCAATCG ATGGTATCCA AAGCCTATCA TTTGATATTG AAACAAATAA 500 

AAAAATGGTA ACTGCTCAAG AATTAGACTA TAAAGTTAGA AAAT AT CTTA 550 

CAGATAATAA GCAACTATAT ACTAATGGAC CTTCTAAATA TGAAACTGGA 600 

TATATAAAGT TCATACCTAA GAATAAAGAA AGTTTTTGGT TTGATTTTTT 650 

CCCTGAACCA GAATTTACTC AATCTAAATA TCTTATGATA TATAAAGATA 700 

ATGAAACGCT TGACTCAAAC ACAAGCCAAA TTGAAGTCTA CCTAACAACC 750 

AAGTAA 756 



2) INFORMATION FOR SEQ ID NO: 994 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 994 
TGGACTAACA ATCTCGCAAG AGG 23 



2) INFORMATION FOR SEQ ID NO: 995 



553 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 995 
ACATTCTCGT GAGTAACAGG GT 22 



10 



15 2) INFORMATION FOR SEQ ID NO: 996 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 996 

ACAAATCATG AAGGGAATCA TTTAG 25 



30 



40 



45 



60 



2) INFORMATION FOR SEQ ID NO: 997 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 997 
CTAATTCTTG AGCAGTTACC ATT 23 



2) INFORMATION FOR SEQ ID NO: 998 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 998 
GGAGGGGTAA CAAATCATGA AGG 23 



554 
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2) INFORMATION FOR SEQ ID NO: 999 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 999 
15 TTGACCTTGT TGATGACGAA GAG 23 



2) INFORMATION FOR SEQ ID NO: 1000 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100 0 

30 

TTAGTGTGTG GGTTGATTGA ACT 23 



35 2) INFORMATION FOR SEQ ID NO: 1001 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1001 

AAGAGTTGCT TGAATTAGTT GAG 23 



50 

2) INFORMATION FOR SEQ ID NO: 1002 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 894 bases 
55. (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: Genomic DNA 



555 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 

(B) STRAIN: ATCC 7002 94 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1002 

AACATGATCA CTGGTGCCGC TCAAATGGAC GGAGCTATCC TTGTAGTTGC 50 

TTCAACTGAT GGACCAATGC CACAAACTCG TGAGCACATC CTTCTTTCAC 100 

GTCAGGTTGG TGTTAAACAC CTTATCGTGT TCATGAACAA AGTTGACCTT 150 

10 GTTGATGACG AAGAGTTGCT TGAATTAGTT GAGATGGAAA TTCGTGACCT 2 00 

TCTTTCAGAA TACGATTTCC CAGGTGATGA CCTTCCAGTT ATCCAAGGTT 250 

CAGCTCTTAA AGCTCTTGAA GGCGACACTA AATTTGAAGA CATCATCATG 3 00 

GAATTGATGG ATACTGTTGA TTCATACATT CCAGAACCAG AACGCGACAC 350 

TGACAAACCA TTGCTTCTTC CAGTCGAAGA CGTATTCTCA ATTACAGGTC .4 00 

15 GTGGTACAGT TGCTTCAGGA CGTATCGACC GTGGTACTGT TCGTGTCAAC 450 

GACGAAATCG AAATCGTTGG TATCAAAGAA GAAACTAAAA AAGCTGTTGT 500 

TACTGGTGTT GAAATGTTCC GTAAACAACT TGACGAAGGT CTTGCAGGAG 550 

ACAACGTAGG TATCCTTCTT CGTGGTGTTC AACGTGACGA AATCGAACGT 600 

GGTCAAGTTA TTGCTAAACC AAGTTCAATC AACCCACACA CTAAATTCAA 650 

20 AGGTGAAGTA TATATCCTTT CTAAAGACGA AGGTGGACGT CACACTCCAT 700 

TCTTCAACAA CTACCGTCCA CAATTCTACT TCCGTACAAC TGACGTAACA 750 

GGTTCAATCG AACTTCCAGC AGGTACAGAA ATGGTTATGC CTGGTGATAA 800 

CGTGACAATC AACGTTGAGT TGATCCACCC AATCGCCGTA GAACAAGGTA 850 

CTACTTTCTC AATCCGTGAA GGTGGACGTA CTGTTGGTTC AGGT 894 

25 



2) INFORMATION FOR SEQ ID NO: 1003 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 
40 <B) STRAIN: ATCC 14579 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1003 

AATGGATCCT GTATATGCAC AAAAATTAGG CGTTAACATA GATGAATTAC 50 

45 TATTATCACA GCCTGATACA GGGGAGCAAG GATTAGAAAT CGCGGAAGCA 100 

CTTGTACGAA GTGGTGCGGT TGACATTATC GTAATTGACT CTGTAGCAGC 150 

TCTTGTACCG AAAGCAGAGA TTGAAGGCGA CATGGGTGAC TCACACGTAG 2 00 

GTTTACAAGC ACGTTTAATG TCACAAGCAC TTCGTAAGCT TTCAGGAGCA 250 

ATCAACAAAT CAAAAACAAT TGCAATCTTT ATTAACCAAA TTCGTGAAAA 3 00 

50 AGTTGGGGTT ATGTTCGGAA ACCCAGAAAC AA 3 32 



2) INFORMATION FOR SEQ ID NO: 1004 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1212 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 

556 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-01 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1004 

10 ACCAAGAAGC TCAAAAACAT CTGTGGGATA TCTACAACTC CGATCAATAC 50 

GTCTCTTACC CTGACGATGA TTTGCAAGTC GCATCTACGG TCGTAGATGT 100 

TTCAAATGGT AAAGTCATCG CCCAACTTGG AGCTCGTCAC CAAGCAAGTA 150 

ACGTTTCATT TGGTACCAAC CAAGCTGTGG AAACCAATCG TGACTGGGGT 200 

TCTGCTATGA AACCAATCAC CGATTATGCA CCTGCCATAG AATACGGTGT 250 

15 TTATGATTCC ACTGCAACTA TGGTTAATGA TATTCCTTAT AACTATCCGG 300 

GAACAAGCAC ACCTGTCTAC AACTGGGATA GAGCATATTT CGGTAATATT 350 

ACTCTGCAAT ATGCTCTTCA ACAATCACGA AATGTCACAG CCGTTGAGAC 400 

TTTGAATAAG GTCGGTCTAG ATAGAGCTAA AACCTTCCTT AATGGTCTTG 450 

GTATCGACTA TCCAAGCATG CATTATGCAA ACGCCATTTC AAGTAATACA 500 

20 ACAGAATCTA ATAAACAATA CGGAGCAAGT AGTGAAAAAA TGGCTGCTGC 550 

TTATGCTGCC TTTGCAAATG GTGGCACTTA CTATAAACCA ATGTATATCC 600 

ATAAAGTCGT CTTCAGTGAT GGAAGTAAAA AAGAGTTCTC TAATGTCGGA 650 

ACTCGTGCCA TGAAGGAAAC GACAGCCTAT ATGATGACCG ACATGATGAA 700 

AACAGTCTTG ACTTATGGAA CTGGGCGTGG AGCCTATCTT CCTTGGCTTC 750 

25 CTCAAGCTGG TAAAACAGGA ACCTCTAACT ATACAGATGA GGAAGTTGAA 800 

AACCACATCA AGAACACTGG CTATGTAGCT CCAGATGAAA TGTTTGTTGG 850 

TTATACTCGT AAGTATT CTA TGGCTGTATG GACAGGTTAT TCGAATCGTT 900 

TAACTCCTAT CGTTGGAGAT GGTTTCCTAG TTGCAGCTAA AGTTTATCGC 950 

TCAATGATAA CGTATCTATC AGAAGATACT CATC C AGAAG ACTGGACGAT 1000 

3 0 GCCAGACGGA CTTTTCAGAA ACGGGGAATT TGTATTCAAA AATGGAGCTC 1050 

GCCCAATATG GACTGAACCC TCTACTCAAC AATCCTCAAC AGCTGAAAGT 1100 

TCAAGCTCAT CATCAGATAG TTCAACTTCA CAGTCTAGCT CAACCACTCC 1150 

AAGCACAAAT AATAGTACGA CTACCAATCC TAACAATAAT ACGCAACAAT 1200 

CAAATACAAC CC 1212 

35 



2) INFORMATION FOR SEQ ID NO: 1005 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1212 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
50 (B) STRAIN: StrR-02 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1005 

AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

55 TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAAA 100 

TGGTAAAGTC ATCGCCCAAC TTGGAGCTCG TCACCAAGCA AGTAACGTTT 150 

CATTTGGTAC CAACCAAGCT GTGGAAACCA ATCGTGACTG GGGTTCTGCT 200 

ATGAAACCAA TCACCGATTA TGCACCTGCC ATAGAATACG GTGTTTATGA 250 

TTCCACTGCA ACTATGGTTA ATGATATTCC TTATAACTAT CCGGGAACAA 300 

60 GCACACCTGT CTACAACTGG GATAGAGCAT ATTTCGGTAA TATTACTCTG 350 

557 
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CAATATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG AGACTTTGAA 400 

TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT CTTGGTATCG 450 

ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA TACAACAGAA 500 

TCTAATAAAC AATACGGAGC AAGTAGTGAA AAAATGGCTG CTGCTTATGC 550 

5 TGCCTTTGCC AACGGTGGAA TTTACCACAA ACCCATGTAT ATCAATAAGG 600 

TCGTCTTCAG TGACGGTAGT AAAAAAGAAT TTTCAGATGT AGGTACACGA 650 

GCTATGAAAG AAACAACTGC TTACATGATG ACCGAAATGA TGAAAACTGT 700 

CTTGGCATAC GGAACTGGTC GTGGAGCCTA TCTCCCATGG TTAGCGCAAG 750 

CTGGTAAGAC AGGTACTTCT AACTACACAG ATGATGAAAT TGAAAAACAC 800 

10 ATCAAGAACA CTGGCTATGT AGCTCCAGAT GAAATGTTTG TTGGTTATAC 850 

TCGTAAGTAT TCTATGGCTG TATGGACAGG TTATTCGAAT CGTTTAACTC 900 

CTATCGTTGG AGATGGTTTC CTAGTTGCAG CTAAAGTTTA TCGCTCAATG 950 

ATAACGTATC TGTCTGAAGG AAGCAATCCA GAGGATTGGA ATATACCAGA 1000 

GGGGCTCTAC AGAAATGGAG AATTCGTATT TAAAAATGGT GCTCGTTCTA 1050 

15 CGTGGAGCTC ACCTGCTCCA CAACAACCCC CATCAACTGA AAGTTCAAGC 1100 

TCATCATCAG ATAGTTCAAC TTCACAGTCT AACTCAACCA CTCCAAGCAC 1150 

AAATAATAGT ACGACTACCA ATCCTAACAA TAATACGCAA CAATCAAATA 1200 

CAACCCCTGA TC 1212 



20 



35 



2) INFORMATION FOR SEQ ID NO: 1006 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1213 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-03 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1006 



ACCAAGAAGC TCAAAAACAT CTGTGGGATA TCTACAACTC CGATCAATAC 50 

GTCTCTTACC CTGACGATGA TTTGCAAGTC GCATCTACGG TCGTAGATGT 100 

40 TTCAAATGGT AAAGTCATCG CCCAACTTGG AGCTCGTCAC CAAGCAAGTA 150 

ACGTTTCATT TGGTACCAAC CAAGCTGTGG AAACCAATCG TGACTGGGGT 2 00 

TCTGCTATGA AACCAATCAC CGATTATGCA CCTGCCATAG AATACGGTGT 250 

TTATGATTCC ACTGCAACTA TGGTTAATGA TATTCCTTAT AACTATCCGG 300 

GAACAAGCAC ACCTGTCTAC AACTGGGATA GAGCATATTT CGGTAATATT 350 

45 ACTCTGCAAT ATGCTCTTCA ACAATCACGA AATGTCACAG CCGTTGAGAC 400 

TTTGAATAAG GTCGGTCTAG ATAGAGCTAA AACCTTCCTT AATGGTCTTG 450 

GTATCGACTA TCCAAGCATG CATTATGCAA ACGCCATTTC AAGTAATACA 500 

ACAGAATCTA ATAAACAATA CGGAGCAAGT AGTGAAAAAA TGGCTGCTGC 550 

TTATGCTGCC TTTGCCAACG GTGGAATTTA CCACAAACCC ATGTATATCA 600 

50 ATAAGGTCGT CTTCAGTGAC GGTAGTAAAA AAGAATTTTC AGATGTAGGT 650 

ACACGAGCTA TGAAAGAAAC AACTGCTTAC ATGATGACCG AAATGATGAA 700 

AACTGTCTTG GCATACGGAA CTGGTCGTGG AGCCTATCTC CCATGGTTAG 750 

CGCAAGCTGG TAAGACAGGT ACTTCTAACT ACACAGATGA TGAAATTGAA 800 

AAACACATCA AGAACACTGG CTATGTAGCT CCAGATGAAA TGTTTGTTGG 850 

55 TTATACTCGT AAGTATTCTA TGGCTGTATG GACAGGTTAT TCGAATCGTT 900 

TAACTCCTAT CGTTGGAGAT GGTTTCCTAG TTGCAGCTAA AGTTTATCGC 950 

TCAATGATAA CGTATCTGTC TGAAGGAAGC AATCCAGAGG ATTGGAATAT 1000 

ACCAGAGGGG CTCTACAGAA ATGGAGAATT CGTATTTAAA AATGGTGCTC 1050 

GTTCTACGTG GAGCTCACCT GCTCCACAAC AACCCCCATC AACTGAAAGT 1100 

60 TCAAGCTCAT CATCAGATAG TTCAACTTCA CAGTCTAACT CAACCACTCC 1150 

558 
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AAGCACAAAT AATAGTACGA CTACCAATCC TAACAATAAT ACGCAACAAT 1200 
CAAATACAAC CCC 1213 



2) INFORMATION FOR SEQ ID NO: 1007 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1218 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-04 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1007 

ACCAAGAAGC TCAAAAACAT CTGTGGGATA TCTACAACTC CGATCAATAC 50 

GTCTCTTACC CTGACGATGA TTTGCAAGTC GCATCTACGG TCGTAGATGT 100 

TTCAAATGGT AAAGTCATCG CCCAACTTGG AGCTCGTCAC CAAGCAAGTA 150 

25 ACGTTTCATT TGGTACCAAC CAAGCTGTGG AAACCAATCG TGACTGGGGT 200 

TCTGCTATGA AACCAATCAC CGATTATGCA CCTGCCATAG AATACGGTGT 250 

TTATGATTCC ACTGCAACTA TGGTTAATGA TATTCCTTAT AACTATCCGG 3 00 

GAACAAGCAC ACCTGTCTAC AACTGGGATA GAGCATATTT CGGTAATATT 3 50 

ACTCTGCAAT ATGCTCTTCA ACAATCACGA AATGTCACAG CCGTTGAGAC 400 

30 TTTGAATAAG GTCGGTCTAG ATAGAGCTAA AACCTTCCTT AATGGTCTTG 450 

GTATCGACTA TCCAAGCATG CATTATGCAA ACGCCATTTC AAGTAATACA 500 

ACAGAATCTA ATAAACAATA CGGAGCAAGT AGTGAAAAAA TGGCTGCTGC 550 

TTATGCTGCC TTTGCAAATG GTGGCACTTA CTATAAACCA ATGTATATCC 600 

ATAAAGTCGT CTTCAGTGAT GGAAGTAAAA AAGAGTTCTC TAATGTCGGA 650 

35 ACTCGTGCCA TGAAGGAAAC GACAGCCTAT ATGATGACCG ACATGATGAA 700 

AACAGTCTTG ACTTATGGAA CTGGGCGTGG AGCCTATCTT CCTTGGCTTC 750 

CTCAAGCTGG TAAAACAGGA ACCTCTAACT ATACAGATGA GGAAGTTGAA 800 

AACCACATCA AGAACACTGG CTATGTAGCT CCAGATGAAA TGTTTGTTGG 850 

TTATACTCGT AAGTATTCTA TGGCTGTATG GACAGGTTAT TCGAATCGTT 900 

40 TAACTCCTAT CGTTGGAGAT GGTTTCCTAG TTGCAGCTAA AGTTTATCGC 950 

TCAATGATAA CGTATCTATC AGAAGATACT CATCCAGAAG ACTGGACGAT 1000 

GCCAGACGGA CTTTTCAGAA ACGGGGAATT TGTATTCAAA AATGGAGCTC 1050 

GCCCAATATG GACTGAACCC TCTACTCAAC AATCCTCAAC AGCTGAAAGT 1100 

TCAAGCTCAT CATCAGATAG TTCAACTTCA CAGTCTAGCT CAACCACTCC 1150 

45 AAGCACAAAT AATAGTACGA CTACCAATCC TAACAATAAT ACGCAACAAT 12 00 

CAAATACAAC CCCTGATC 1218 



50 2) INFORMATION FOR SEQ ID NO: 1008 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1223 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

559 
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(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1008 

5 

GTAGACCAAG AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA 50 

ATACGTCTCT TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG 100 

ATGTTTCAAA TGGTAAAGTC ATCGCCCAAC TTGGAGCTCG TCACCAAGCA 150 

AGTAACGTTT CATTTGGTAC CAACCAAGCT GTGGAAACCA ATCGTGACTG 200 

10 GGGTTCTGCT ATGAAACCAA TCACCGATTA TGCACCTGCC ATAGAATACG 250 

GTGTTTATGA TTCCACTGCA ACTATGGTTA ATGATATTCC TTATAACTAT 300 

CCGGGAACAA GCACACCTGT CTACAACTGG GATAGAGCAT ATTTCGGTAA 350 

TATTACTCTG CAATATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG 400 

AGACTTTGAA TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT 450 

15 CTTGGTATCG ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA 500 

TACAACAGAA TCTAATAAAC AATACGGAGC AAGTAGTGAA AAAATGGCTG 550 

CTGCTTATGC TGCCTTTGCA AATGGTGGCA CTTACTATAA ACCAATGTAT 600 

ATCCATAAAG TCGTCTTCAG TGATGGAAGT AAAAAAGAGT TCTCTAATGT 650 

CGGAACTCGT GCCATGAAGG AAACGACAGC CTATATGATG ACCGACATGA 700 

2 0 TGAAAACAGT CTTGACTTAT GGAACTGGGC GTGGAGCCTA TCTTCCTTGG 750 

CTTCCTCAAG CTGGTAAAAC AGGAACCTCT AACTATACAG ATGAGGAAGT 800 

TGAAAACCAC ATCAAGAACA CTGGCTATGT AGCTCCAGAT GAAATGTTTG 850 

TTGGTTATAC TCGTAAGTAT TCTATGGCTG TATGGACAGG TTATTCGAAT 900 

CGTTTAACTC CTATCGTTGG AGATGGTTTC CTAGTTGCAG CTAAAGTTTA 950 

25 TCGCTCAATG ATAACGTATC TATCAGAAGA TACTCATCCA GAAGACTGGA 1000 

CGATGCCAGA CGGACTTTTC AGAAACGGGG AATTTGTATT CAAAAATGGA 1050 

GCTCGCCCAA TATGGACTGA ACCCTCTACT CAACAATCCT CAACAGCTGA 1100 

AAGTTCAAGC TCATCATCAG ATAGTTCAAC TTCACAGTCT AG CTCAAC CA 1150 

CTCCAAGCAC AAATAATAGT ACGACTACCA ATCCTAACAA TAATACGCAA 1200 

3 0 CAATCAAATA CAACCCCTGA TCA 1223 



2) INFORMATION FOR SEQ ID NO: 1009 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1214 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1009 

50 ACCAAGAAGC TCAAAAACAT CTGTGGGATA TCTACAACTC CGATCAATAC 50 

GTCTCTTACC CTGACGATGA TTTGCAAGTC GCATCTACGG TCGTAGATGT 100 

TTCAAATGGT AAAGTCATCG CCCAACTTGG AGCTCGTCAC CAAGCAAGTA 150 

ACGTTTCATT TGGTACCAAC CAAGCTGTGG AAACCAATCG TGACTGGGGT 200 

TCTGCTATGA AACCAATCAC CGATTATGCA CCTGCCATAG AATACGGTGT 2 50 

55 TTATGATTCC ACTGCAACTA TGGTTAATGA TATTCCTTAT AACTATCCGG 300 

GAACAAGCAC ACCTGTCTAC AACTGGGATA GAGCATATTT CGGTAATATT 350 

ACTCTGCAAT ATGCTCTTCA ACAATCACGA AATGTCACAG CCGTTGAGAC 400 

TTTGAATAAG GTCGGTCTAG ATAGAGCTAA AACCTTCCTT AATGGTCTTG 450 

GTATCGACTA TCCAAGCATG CATTATGCAA ACGCCATTTC AAGTAATACA 500 

60 ACAGAATCTA ATAAACAATA CGGAGCAAGT AGTGAAAAAA TGGCTGCTGC 550 

560 



SUBSTITUTE SHEET (RULE 26) 
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TTATGCTGCC TTTGCAAATG GTGGCACTTA 
ATAAAGTCGT CTTCAGTGAT GGAAGTAAAA 
ACTCGTGCCA TGAAGGAAAC GACAGCCTAT 
AACAGTCTTG ACTTATGGAA CTGGGCGTGG 
5 CTCAAGCTGG TAAAACAGGA ACCTCTAACT 
AACCACATCA AGAACACTGG CTATGTAGCT 
TTATACTCGT AAGTATTCTA TGGCTGTATG 
TAACTCCTAT CGTTGGAGAT GGTTTCCTAG 
TCAATGATAA CGTATCTATC AGAAGATACT 
10 GCCAGACGGA CTTTTCAGAA ACGGGGAATT 
GCCCAATATG GACTGAACCC TCTACTCAAC 
TCAAGCTCAT CATCAGATAG TTCAACTTCA 
AAGCACAAAT AATAGTACGA CTACCAATCC 
CAAATACAAC CCCT 

15 



PCT/CAOO/01150 

CTATAAACCA ATGTATATCC 600 

AAGAGTTCTC TAATGTCGGA 650 

ATGATGACCG ACATGATGAA 700 

AGCCTATCTT CCTTGGCTTC 750 

ATACAGATGA GGAAGTTGAA 800 

CCAGATGAAA TGTTTGTTGG 850 

GACAGGTTAT TCGAATCGTT 900 

TTGCAGCTAA AGTTTATCGC 950 

CATCCAGAAG ACTGGACGAT 1000 

TGTATTCAAA AATGGAGCTC 1050 

AATCCTCAAC AGCTGAAAGT 1100 

CAGTCTAGCT CAACCACTCC 1150 

TAACAATAAT ACGCAACAAT 1200 

1214 



2) INFORMATION FOR SEQ ID NO: 1010 

20 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1223 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



60 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
30 (B) STRAIN: StrR-07 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1010 

CCAAGAAGCT CAAAAACATC TGTGGGATAT CTACAACTCC GATCAATACG 50 

35 . TCTCTTACCC TGACGATGAT TTGCAAGTCG CATCTACGGT CGTAGATGTT 100 

TCAAATGGTA AAGTCATCGC ACAACTTGGT GCTCGTCATC AAGCAAGTAA 150 

TGTTTCATTC GGTACCAACC AGGCCGTAGA AACCAATCGT GACTGGGGAT 200 

CATCAATGAA ACCAATCACT GACTATGCTC CCGCTTTAGA ATATGGAGTC 250 

TATGACTCTA CTGCTTCTAT TGTACATGAT GTCCCTTATA ACTATCCTGG 300 

40 CACTGATACT CCACTCTACA ACTGGGATCA TGTCTACTTT GGAAACATTA 3 50 

CAATCCAGTA TGCTCTTCAA CAATCACGAA ATGTCACAGC CGTTGAGACT 4 00 

TTGAATAAGG TCGGTCTAGA TAGAGCTAAA ACCTTCCTTA ATGGTCTTGG 450 

TATCGACTAT CCAAGCATGC ATTATGCAAA CGCCATTTCA AGTAACACAA 500 

CTGAATCCAA CAAAAAATAT GGTGCAAGTA GTGAAAAAAT GGCTGCTGCC 550 

45 TACGCTGCTT TTGCTAATGG TGGTATTTAT CACAAACCAA TGTATATCAA 600 

TAAAATCGTC TTTAGTGATG GTAGCGAAAA AGAATTTTCT GATGCTGGTA 650 

CACGAGCTAT GAAAGAGACT ACTGCCTATA TGATGACTGA AATGATGAAA 700 

ACTGTTTTAA CTTACGGAAC AGGACGTGGA GCCTACCTAC CATGGCTTCC 750 

ACAAGCAGGT AAGACAGGTA CTTCTAACTA TACTGACGAA GAAATTGAAA 800 

50 AGTATATCAA GAACACTGGT TACGTAGCTC CAGATGAAAT GTTTGTAGGG 850 

TATACCCGTA AATATGCAAT GGCTGTTTGG ACAGGATACT CAAATCGTCT 900 

AACTCCAATC ATCGGAGATG GTTTCCTTGT TGCTGGTAAA GTCTATCGTT 950 

CAATGATAAC TTACCTTTCT GAAGATGACC AACCTGGAGA TTGGACAATG 1000 

CCAGATGGCT TGTATAGAAA TGGAGAATTC GTATTTAAAA ATGGTGCTCG 1050 

55 TTCTACGTGG AGCTCACCTG CTCCACAACA ACCCCCATCA ACTGAAAGTT 1100 

CAAGCTCATC ATCAGATAGT TCAACTTCAC AGTCTAACTC AACCACTCCA 1150 

AGCACAAATA ATAGTACGAC TACCAATCCT AACAATAATA CGCAACAATC 12 00 

AAATACAACC CCTGATCAAC AAA 1223 



561 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 1011 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 07 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-08 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1011 

AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAAA 100 

TGGTAAAGTC ATCGCCCAAC TTGGAGCTCG TCACCAAGCA AGTAACGTTT 150 

CATTTGGTAC CAACCAAGCT GTGGAAACCA ATCGTGACTG GGGTTCTGCT 200 

ATGAAACCAA TCACCGATTA TGCACCTGCC ATAGAATACG GTGTTTATGA 250 

TTCCACTGCA ACTATGGTTA ATGATATTCC TTATAACTAT CCGGGAACAA 300 

GCACACCTGT CTACAACTGG GATAGAGCAT ATTTCGGTAA TATTACTCTG 350 

CAATATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG AGACTTTGAA 4 00 

TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT CTTGGTATCG 450 

ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA TACAACAGAA 500 

TCTAATAAAC AATACGGAGC AAGTAGTGAA AAAATGGCTG CTGCTTATGC 550 

TGCCTTTGCA AATGGTGGCA CTTACTATAA ACCAATGTAT ATCCATAAAG 600 

TCGTCTTCAG TGATGGAAGT AAAAAAGAGT TCTCTAATGT CGGAACTCGT 650 

GCCATGAAGG AAACGACAGC CTATATGATG ACCGACATGA TGAAAACAGT 700 

CTTGACTTAT GGAACTGGGC GTGGAGCCTA TCTTCCTTGG CTTCCTCAAG 750 

CTGGTAAAAC AGGAACCTCT AACTATACAG ATGAGGAAGT TGAAAACCAC 800 

ATCAAGAACA CTGGCTATGT AGCTCCAGAT GAAATGTTTG TTGGTTATAC 850 

TCGTAAGTAT TCTATGGCTG TATGGACAGG TTATTCGAAT CGTTTAACTC 900 

CTATCGTTGG AGATGGTTTC CTAGTTGCAG CTAAAGTTTA TCGCTCAATG 950 

ATAACGTATC TATCAGAAGA TACTCATCCA GAAGACTGGA CGATGCCAGA 1000 

CGGACTTTTC AGAAACGGGG AATTTGTATT CAAAAATGGA GCTCGCCCAA 1050 

TATGGACTGA ACCCTCTACT CAACAATCCT CAACAGCTGA AAGTTCAAGC 1100 

TCATCATCAG ATAGTTCAAC TTCACAGTCT AGCTCAACCA CTCCAAGCAC 1150 

AAATAATAGT ACGACTACCA ATCCTAACAA TAATACGCAA CAATCAAATA 1200 

CAACCCC 12 07 



2) INFORMATION FOR SEQ ID NO: 1012 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1201 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-09 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1012 

562 
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PCT/CA00/01150 



AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAAA 100 

TGGTAAAGTC ATCGCCCAAC TTGGAGCTCG TCACCAAGCA AGTAACGTTT 150 

5 CATTTGGTAC CAACCAAGCT GTGGAAACCA ATCGTGACTG GGGTTCTGCT 200 

ATGAAACCAA TCACCGATTA TGCACCTGCC ATAGAATACG GTGTTTATGA 250 

TTCCACTGCA ACTATGGTTA ATGATATTCC TTATAACTAT CCGGGAACAA 300 

GCACACCTGT CTACAACTGG GATAGAGCAT ATTTCGGTAA TATTACTCTG 350 

CAATATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG AGACTTTGAA 400 

10 TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT CTTGGTATCG 450 

ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA TACAACAGAA 500 

TCTAATAAAC AATACGGAGC AAGTAGTGAA AAAATGGCTG CTGCTTATGC 550 

TGCCTTTGCC AACGGTGGAA TTTACCACAA ACCCATGTAT ATCAATAAGG 600 

TCGTCTTCAG TGACGGTAGT AAAAAAGAAT TTTCAGATGT AGGTACACGA 650 

15 GCTATGAAAG AAACAACTGC TTACATGATG ACCGAAATGA TGAAAACTGT 700 

CTTGGCATAC GGAACTGGTC GTGGAGCCTA TCTCCCATGG TTAGCGCAAG 750 

CTGGTAAGAC AGGTACTTCT AACTACACAG ATGATGAAAT TGAAAAACAC 800 

ATCAAGAACA CTGGCTATGT AGCTCCAGAT GAAATGTTTG TTGGTTATAC 850 

TCGTAAGTAT TCTATGGCTG TATGGACAGG TTATTCGAAT CGTTTAACTC 900 

20 CTATCGTTGG AGATGGTTTC CTAGTTGCAG CTAAAGTTTA TCGCTCAATG 950 

ATAACGTATC TATCAGAAGA TACTCATCCA GAAGACTGGA CGATGCCAGA 1000 

CGGACTTTTC AGAAACGGGG AATTTGTATT CAAAAATGGA GCTCGTTCTA 1050 

CGTGGAACTC ACCTGCTCCA CAACAACCCC CATCAACTGA AAGTTCAAGC 1100 

TCATCATCAG ATAGTTCAAC TTCACAGTCT AGCTCAACCA CTCCAAGCAC 1150 

25 AAATAATAGT ACGACTACCG ATCCTAACAA TAATACGCAA CAATCAAATA 1200 

C 1201 



30 2) INFORMATION FOR SEQ ID NO: 1013 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 0 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-10 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1013 



TGTAGACCAA GAAGCTCAAA AACATCTGTG GGATATCTAC AACTCCGATC 50 

AATACGTCTC TTACCCTGAC GATGATTTGC AAGTCGCATC TACGGTCGTA 100 

GATGTTTCAA ATGGTAAAGT CATCGCACAA CTTGGTGCTC GTCATCAAGC 150 

AAGTAATGTT TCATTCGGTA CCAACCAGGC CGTAGAAACC AATCGTGACT 2 00 

50 GGGGATCATC AATGAAACCA ATCACTGACT ATGCTCCCGC TTTAGAATAT 2 50 

GGAGTCTATG ACTCTACTGC TTCTATTGTA CATGATGTCC CTTATAACTA 3 00 

TCCTGGCACT GATACTCCAC TCTACAACTG GGATCATGTC TACTTTGGAA 350 

ACATTACAAT CCAGTATGCT CTTCAACAAT CACGAAATGT CACAGCCGTT 400 

GAGACTTTGA ATAAGGTCGG TCTAGATAGA GCTAAAACCT TCCTTAATGG 450 

55 TCTTGGTATC GACTATCCAA GCATGCATTA TGCAAACGCC ATTTCAAGTA 500 

ACACAACTGA ATCCAACAAA AAATATGGTG CAAGTAGTGA AAAAATGGCT 550 

GCTGCCTACG CTGCTTTTGC TAATGGTGGT ATTTATCACA AACCAATGTA 600 

TATCAATAAA ATCGTCTTTA GTGATGGTAG CGAAAAAGAA TTTTCTGATG 650 

CTGGTACACG AGCTATGAAA GAGACTACTG CCTATATGAT GACTGAAATG 700 

60 ATGAAAACTG TTTTAACTTA CGGAACAGGA CGTGGAGCCT ACCTACCATG 750 

563 
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GCTTCCACAA GCAGGTAAGA CAGGTACTTC 
TTGAAAAGTA TATCAAGAAC ACTGGCTACG 
GTGGGTTATA CTCGTAAGTA TTCTATGGCT 
TCGTTTAACT CCTATCGTTG GAGATGGTTT 
5 ATCGCTCTAT GATGACCTAC CTGTCTGAAG 
AATATACCAG AGGGGCTCTA CAGAAATGGA 
TGCTCGTTCT ACGTGGAGCT CACCTGCTCC 
AAAGTTCAAG CTCATCATCA GATAGTTCAA 
ACTCCAAGCA CAAATAATAG TACGACTACC 
10 ACAATCAAAT ACAACCCCTG 



PCT/CA00/01150 

TAACTATACT GACGAAGAAA 800 

TAGCTCCAGA TGAAATGTTT 850 

GTATGGACAG GTTATTCGAA 900 

CCTAGTTGCA GCTAAAGTTT 950 

GAAGCAATCC AGAGGATTGG 1000 

GAATTCGTAT TTAAAAATGG 1050 

ACAACAACCC CCATCAACTG 1100 

CTTCACAGTC TAGCTCAACC 1150 

AATCCTAACA ATAATACGCA 12 00 

1220 



2) INFORMATION FOR SEQ ID NO: 1014 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1199 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1014 

3 0 CAAAAACATC TGTGGGATAT TTACAATACA GACGAATACG TTGCCTATCC 50 

AGACGATGAA TTGCAAGTCG CTTCTACCAT TGTTGATGTT TCTAACGGTA 100 

AAGTCATTGC CCAGCTAGGA GCACGCCATC AGTCAAGTAA TGTTTCGTTC 150 

GGAATTAACC AAGCAGTAGA AACAAACCGC GACTGGGGAT CAACTATGAA 2 00 

ACCGATCACA GACTATGCTC CTGCCTTGGA GTACGGTGTC TACGATTCAA 250 

35 CTGCTACTAT CGTTCACGAT GAGCCCTATA ACTACCCTGG GACAAATACC 3 00 

CCTGTTTATA ACTGGGATAG GGGCTACTTT GGCAACATCA CCTTGCAATA 3 50 

CGCCCTGCAA CAATCGCGAA ACGTCCCAGC CGTGGAAACT CTAAACAAGG 400 

TCGGACTCAA CCGCGCCAAG ACTTTCCTAA ATGGTCTCGG AATCGACTAC 450 

CCAAGTATTC ACTACTCAAA TGCCATTTCA AGTAACACAA CCGAATCAGA 500 

40 CAAAAAATAT GGAGCAAGTA GTGAAAAGAT GGCTGCTGCT TACGCTGCCT 550 

TTGCAAATGG TGGAACTTAC TATAAACCAA TGTATATCCA TAAAGTCGTC 600 

TTTAGTGATG GGAGTGAAAA AGAGTTCTCT AATGTCGGAA CTCGTGCCAT 650 

GAAGGAAACG ACAGCCTATA TGATGACCGA. CATGATGAAA ACTGTCTTAG 700 

TATACGGAAT CGGACGTGGA GCCTACCTAC CTTGGCTTCC ACAAGCAGGT 750 

45 AAAACAGGTA CTTCTAACTA TACTGACGAA GAAATTGAAA AGTATATCAA 800 

GAACACTGGT TACGTAGCTC CAGATGAAAT GTTTGTAGGG TATACCCGCA 850 

AATATGCAAT GGCTGTATGG ACAGGCTATT CTAACCGTCT GACACCACTT 900 

GTAGGCGATG GCCTTACGGT CGCTGCTAAA GTTTACCGCT CTATGATGAC 950 

CTACCTGTCT GAAGGAAGCA ATCCAGAGGA TTGGAATATA CCAGAGGGGC 1000 

50 TCTACAGAAA TGGAGAATTC GTATTTAAAA ATGGTGCTCG TTCTACGTGG 1050 

AACTCACCTG CTCCACAACA ACCCCCATCA ACTGAAAGTT CAAGCTCATC 1100 

ATCAGATAGT TCAACTTCAC AGTCTAGCTC AACCACTCCA AGCACAAATA 1150 

ATAGTACGAC TACCAATCCT AACAATAATA CGCAACAATC AAATACAAC 1199 



55 



2) INFORMATION FOR SEQ ID NO: 1015 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 1211 bases 



564 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-12 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1015 

AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAAA 100 

TGGTAAAGTC ATCGCCCAAC TTGGAGCTCG TCACCAAGCA AGTAACGTTT 150 

CATTTGGTAC CAACCAAGCT GTGGAAACCA ATCGTGACTG GGGTTCTGCT 200 

ATGAAACCAA TCACCGATTA TGCACCTGCC ATAGAATACG GTGTTTATGA 250 

TTCCACTGCA ACTATGGTTA ATGATATTCC TTATAACTAT CCGGGAACAA 300 

GCACACCTGT CTACAACTGG GATAGAGCAT ATTTCGGTAA TATTACTCTG 350 

CAATATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG AGACTTTGAA 400 

TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT CTTGGTATCG 450 

ACT ATC C AAG CATGCATTAT GCAAACGCCA TTTCAAGTAA TACAACAGAA 500 

TCTAATAAAC AATACGGAGC AAGTAGTGAA AAAATGGCTG CTGCTTATGC 550 

TGCCTTTGCC AACGGTGGAA TTTACCACAA ACCCATGTAT ATCAATAAGG 600 

TCGTCTTCAG TGACGGTAGT AAAAAAGAAT TTTCAGATGT AGGTACACGA 650 

GCTATGAAAG AAACAACTGC TTACATGATG ACCGAAATGA TGAAAACTGT 700 

CTTGGCATAC GGAACTGGTC GTGGAGCCTA TCTCCCATGG TTAGCGCAAG 750 

CTGGTAAGAC AGGTACTTCT AACTACACAG ATGATGAAAT TGAAAAACAC 800 

ATCAAGAACA CTGGCTATGT AGCTCCAGAT GAAATGTTTG TTGGTTATAC 850 

TCGTAAGTAT TCTATGGCTG TATGGACAGG TTATTCGAAT CGTTTAACTC 900 

CTATCGTTGG AGATGGTTTC CTAGTTGCAG CTAAAGTTTA TCGCTCAATG 950 

ATAACGTATC TGTCTGAAGG AAGCAATCCA GAGGATTGGA ATATACCAGA 1000 

GGGGCTCTAC AGAAATGGAG AATTCGTATT TAAAAATGGT GCTCGTTCTA 1050 

CGTGGAGCTC ACCTGCTCCA CAACAACCCC CATCAACTGA AAGTTCAAGC 1100 

TCATCATCAG ATAGTTCAAC TTCACAGTCT AACTCAACCA CTCCAAGCAC 1150 

AAATAATAGT ACGACTACCA ATCCTAACAA TAATACGCAA CAATCAAATA 1200 

CAACCCCTGA T 1211 



2) INFORMATION FOR SEQ ID NO: 1016 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1222 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-13. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1016 

GTAGACCAAG AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA 50 

ATACGTCTCT TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG 100 

ATGTTTCAAA TGGTAAAGTC ATCGCACAAC TTGGTGCTCG TCATCAAGCA 150 

AGTAATGTTT CATTCGGTAC CAACCAGGCC GTAGAAACCA ATCGTGACTG 2 00 

565 
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GGGATCATCA ATGAAACCAA TCACTGACTA 
GAGTCTATGA CTCTACTGCT TCTATTGTAC 
CCTGGCACTG ATACTCCACT CTACAACTGG 
CATTACAATC CAGTATGCTC TTCAACAATC 
5 AGACTTTGAA TAAGGTCGGT CTAGATAGAG 
CTTGGTATCG ACTATCCAAG CATGCATTAT 
CACAACTGAA TCCAACAAAA AATATGGTGC 
CTGCCTACGC TGCTTTTGCT AATGGTGGTA 
ATCAATAAAA TCGTCTTTAG TGATGGTAGC 

10 TGGTACACGA GCTATGAAAG AGACTACTGC 
TGAAAACTGT TTTAACTTAC GGAACAGGAC 
CTTCCACAAG CAGGTAAGAC AGGTACTTCT 
TGAAAAGTAT ATCAAGAACA CTGGCTACGT 
TGGGTTATAC TCGTAAGTAT TCTATGGCTG 

15 CGTTTAACTC CTATCGTTGG AGATGGTTTC 
TCGCTCTATG ATGACCTACC TGTCTGAAGG 
ATATACCAGA GGGGCTCTAC AGAAATGGAG 
GCTCGTTCTA CGTGGAGCTC ACCTGCTCCA 
AAGTTCAAGC TCATCATCAG ATAGTTCAAC 

20 CTCCAAGCAC AAATAATAGT ACGACTACCA 
CAATCAAATA CAACCCCTGA TC 



PCT/CA00/01150 

TGCTCCCGCT TTAGAATATG 250 

ATGATGTCCC TTATAACTAT 300 

GATCATGTCT ACTTTGGAAA 350 

ACGAAATGTC ACAGCCGTTG 400 

CTAAAACCTT CCTTAATGGT 450 

GCAAACGCCA TTTCAAGTAA 500 

AAGTAGTGAA AAAATGGCTG 550 

TTTATCACAA ACCAATGTAT 600 

GAAAAAGAAT TTTCTGATGC 650 

CTATATGATG ACTGAAATGA 700 

GTGGAGCCTA CCTACCATGG 750 

AACTATACTG ACGAAGAAAT 800 

AGCTCCAGAT GAAATGTTTG 850 

TATGGACAGG TTATTCGAAT 900 

CTAGTTGCAG CTAAAGTTTA 950 

AAGCAATCCA GAGGATTGGA 1000 

AATTCGTATT TAAAAATGGT 1050 

CAACAACCCC CATCAACTGA 1100 

TTCACAGTCT AGCTCAACCA 1150 

ATCCTAACAA TAATACGCAA 1200 

1222 



25 2) INFORMATION FOR SEQ ID NO: 1017 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 9 bases 

(B) TYPE: Nucleic acid 
30 (C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-14 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1017 



TGTAGACCAA GAAGCTCAAA AACATCTGTG GGATATTTAC AATACAGACG 50 

AATACGTTGC CTATCCAGAC GATGAATTGC AAGTCGCTTC TACCATTGTT 100 

GATGTTTCTA ACGGTAAAGT CATTGCCCAG CTAGGAGCAC GCCATCAGTC 150 

AAGTAATGTT TCCTTCGGAA TTAACCAAGC AGTAGAAACA AACCGCGACT 200 

45 GGGGATCAAC TATGAAACCG ATCACAGACT ATGCTCCTGC CTTGGAGTAC 250 

GGTGTCTACG ATTCAACTGC TACTATCGTT CACGATGAGC CCTATAACTA 3 00 

CCCTGGGACA AATACCCCTG TTTATAACTG GGATAGGGGC TACTTTGGCA 350 

ACATCACCTT GCAATACGCC CTGCAACAAT CGCGAAACGT CCCAGCCGTG 4 00 

GAAACTCTAA ACAAGGTCGG ACTCAACCGC GCCAAGACTT TCCTAAATGG 450 

50 TCTCGGAATC GACTACCCAA GTATTCACTA CTCAAATGCC ATTTCAAGTA 500 

ACACAACCGA ATCAGACAAA AAATATGGAG CAAGTAGTGA AAAGATGGCT 550 

GCTGCTTACG CTGCCTTTGC AAATGGTGGA ACTTACTATA AAC CAATGTA 600 

TATCCATAAA GTCGTCTTTA GTGATGGGAG TGAAAAAGAG TTCTCTAATG 650 

TCGGAACTCG TGCCATGAAA GAAACAACTG CTTACATGAT GACCGAAATG 700 

55 ATGAAAACTG TCCTGGCATA CGGAAGTGGT CGTGGAGCCT ATCTCCCATG 750 

GTTAGCGCAA GCTGGTAAGA CAGGTACTTC TAACTACACA GATGATGAAA 800 

TTGAAAAACA CATCAAGAAC ACTGGCTATG TAGCTCCAGA TGAAATGTTT 850 

GTTGGTTATA CTCGTAAGTA TTCTATGGCT GTATGGACAG GTTATTCGAA 900 

TCGTTTAACT CCTATCGTTG GAGATGGTTT CCTAGTTGCA GCTAAAGTTT 950 

60 ATCGCTCAAT GATAACGTAT CTATCAAAAG ATACTCATCC AGAAGACTGG 1000 

566 
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ACGATGCCAG ACGGACTTTT CAGAAACGGG 
AGCTCGTTCT ACGTGGAACT CACCTGCTCC 
AAAGTTCAAG CTCATCATCA GATAGTTCAA 
ACTCCAAGCA CAAATAATAG TACGACTACC 
5 ACAATCAAAT ACAACCCCTG ATCAACAAA 



PCT/CA00/01150 
GAATTTGTAT TCAAAAATGG 1050 
ACAACAACCC CCATCAACTG 1100 
CTTCACAGTC TAACTCAACC 1150 
AATCCTAACA ATAATACGCA 12 00 

1229 



2 ) INFORMATION FOR SEQ ID NO: 1018 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1225 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1018 

25 ATGTAGACCA AGAGGCTCAA AAACGTCTGT GGGATATCTA CAACTCCGAT 50 

CAATACGTCT CTTACCCTGA CGATGATTTG CAAGTCGCAT CTACGGTCGT 100 

AGATGTTTCA AATGGTAAAG TCATCGCCCA ACTTGGAGCT CGTCACCAAG 150 

CAAGTAACGT TTCATTTGGT ACCAACCAAG CTGTGGAAAC CAACCGTGAC 200 

TGGGGATCAA GCATGAAACC AATCACTGAT TATGCCCCAG CCTTAGAATA 250 

30 TGGTGTATAT GATTCCACTG CAACTATGGT TAATGATATT CCTTATAACT 3 00 

ATCCGGGCAC AAGCACACCT GTCTACAACT GGGATCGAGC ATATTTTGGT 3 50 

AATATTAGCC TGCAATATGC CCTTCAACAA TCTCGTAACG TGCCTGCCGT 400 

TGAAACACTA AACAAGGTTG GTTTAGATAG AGCCAAAACT TTCCTAAATG 4 50 

GTTTGGGAAT TGACTATCCA AGTATTCACT ACTCAAATGC TATTTCAAGT 500 

35 AATACAACTG AATCTAGTAA ACAGTACGGG GCAAGCAGTG AGAAAATGGC 550 

TGCGGCTTAC GCTGCATTCG CTAATGGCGG TATTTACCAC AAACCAATGT 600 

ACATCAATAA AGTTGTCTTT AGTGATGGTA GCGAAAAAGA ATTTTCTGAT 650 

GCTGGTACAC GAGCTATGAA AGAGACTACT GCCTATATGA TGACTGAAAT 700 

GATGAAAACT GTTTTAACTT ACGGAACAGG ACGTGGAGCC TACCTACCAT 750 

40 GGCTTCCACA AGCAGGTAAG ACAGGTACTT CTAACTATAC TGACGACGAA 800 

ATTGAAAAGT ATATCAAGAA CACTGGCTAC GTAGCTCCAG ATGAAATGTT 850 

TGTGGGTTAT ACTCGTAAGT ATTCTATGGC TGTATGGACT GGATACTCAA 900 

ATCGTTTAAC TCCAATCATT GGAGATGGTT TCCTAGTTGC TGCCAAAGTT 950 

TATCGCTCAA TGATATCGTA TCTATCAGAA GATGACCATC CTGGAGATTG 1000 

45 GACAATGCCT GAGGGAGTAT ACAGAAGTGG AGAATTCGTA TTTAAAAATG 1050 

GTGCTCGTTC TACGTGGAGC TCCCCTGCTC CACAACAACC CCCATCAACT 1100 

GAAAGTTCAA GCTCATCATC AGATAGTTCA ACTTCACAGT CTAGCTTAAC 1150 

CACTCCAAGC ACAAATAATA GTACGACTAC CAATCCTAAC AACAATACGC 12 00 

AACAATCAAA TACAACCCCT GATCA 1225 

50 



2) INFORMATION- FOR SEQ ID NO: 1019 

55 - (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



567 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Streptococcus pneumoniae 
5 (B) STRAIN: StrR-01 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1019 

GCCTCTATTT CAAAGGAGAT GCCTGGCATT AGTATTTCTA CTTCTTGGGA 50 

10 TCGAAAGGTT TTGGAAACTT CCCTTTCTTC TATAGTAGGG AGTGTATCCA 100 

GTGAAAAAGC TGGTCTCCCA GCGGAAGAAG CAGAATCCTA TCTTAAAAAA 150 

GGCTATTCTC TAAATGACCG TGTTGGAACC TCCTATTTGG AAAAGCAATA 2 00 

TGAAGAGACC TTACAAGGAA AACGCTCGGT AAAAGAAATC CATCTGGATA 250 

AATATGGCAA TATGGAAAGC GTGGACACAA TTGAGGAAGG TAGTAAGGGA 300 

15 AACAATATCA AACTGACCAT TGATTTGGCC TTCCAAGATA GCGTGGATGC 350 

TTTGCTGAAA AGTTATTTCA ATTCCGAGCT AGGAAATGGT GGAGCTAAAT 400 

ATTCTGAAGG TGTCTATGCA GTCGCCCTTA ACCCAAAAAC AGGTGCTGTT 450 

TTATCCATGT CAGGGATCAA ACATGACCTG AAAACGGGAG AGTTGACTCC 500 

TGATTCCTTG GGAACGGTAA CCAATGTCTT TGTCCCAGGT TCGGTTGTCA 550 

20 AGGCTGCGAC CATCAGCTCA GGTTGGGAAA ATGGTGTTTT ATCAGGAAAC 600 

CAAACCTTAA CAGATCAGCC TATTGTTTTC CAAGGTTCAG CTCCAATTTA 650 

TTCTTGGTAT AAATTGGCAT ATGGATCTTT TC C TATTACA GCTGTGGAAG 700 

CCTTGGAGTA TTCATCCAAT GCTTACATGG TTCAAACCGC TCTTGGAATC 750 

ATGGGCCAGA CCTATCAACC AAATATG TTT GTTGGAACCA GCAATTTGGA 800 

25 AACAGCTATG GGAAAACTTC GTGCGACCTT TGGCGAATAT GGCTTGGGGG 850 

CTGCGACCGG AATTGACCTA CCAGATGAAT CTACTGGATT TGTTCCCAAA 900 

GAGTATAGCT TTGCTAATTA CATCACCAAT TCCTTTGGGC AGTTTGATAA 950 

CTATACGCCC ATGCAGTTGG CTCAGTATGT AGCAACTATT GCAAATAATG 1000 

GTGTTCGTGT GGCTCCTCGT ATTGTTGAAG GCATTTATGG TAATAATGAT 1050 

3 0 AAGGGAGGAC TGGGTGACTT GATTCAGCAA CTGCAACCGA CAGAGATGAA 1100 

TAAGGTCAAT ATATCCGACT CCGATATGAG CATCTTGCAC CAAGGTTTTT 1150 

ATCAGGTTGC CCATGGTACT AGTGGATTGA CAACTGGACG TGCCTTTTCA 1200 

AATGGTGCCT TGGTATCCAT TAGCGGAAAA ACAGGTACAG CCGAAAGCTA 1250 

TGTGGCAGAT GGTCAGCAAG CAACCAATAC CAATGCGGTG GCCTATGCCC 13 00 

35 CATCTGATAA TCCCCAAATC GCTGTCGCAG TGGTCTTTCC TCATAATACC 1350 

AATCTAACAA ATGGTGTAGG ACCTTCCATT GCGCGTGACA TTATCAATCT 14 00 

GTATCAAAAA TACCATCCAA TGAACTAGAA AGGAAATTA 143 9 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1020 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1441 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-02 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 0 

GCCTCTATTT CAAAGGAGAT GCCTGGCATT AGTATTTCTA CTTCTTGGGA 50 

TCGAAAGGTT TTGGAAACTT CCCTTTCTTC TATAGTAGGG AGTGTATCCA 100 

GTGAAAAAGC TGGTCTCCCA GCGGAAGAAG CAGAATCCTA TCTTAAAAAA 150 

60 GGCTATTCTC TAAATGACCG TGTTGGAACC TCCTATTTGG AAAAGCAATA 200 

568 
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TGAAGAGACC TTACAAGGAA AACGCTCGGT AAAAGAAATC CATCTGGATA 250 

AATATGGCAA TATGGAAAGC GTGGACACAA TTGAGGAAGG TAGTAAGGGA 3 00 

AACAATATCA AACTGACCAT TGATTTGGCC TTCCAAGATA GCGTGGATGC 350 

TTTGCTGAAA AGTTATTTCA ATTCCGAGCT AGGAAATGGT GGAGCTAAAT 4 00 

5 ATTCTGAAGG TGTCTATGCA GTCGCCCTTA ACCCAAAAAC AGGTGCTGTT 450 

TTATCCATGT CAGGGATCAA ACATGACCTG AAAACGGGAG AGTTGACTCC 500 

TGATTCCTTG GGAACGGTAA CCAATGTCTT TGTCCCAGGT TCGGTTGTCA 550 

AGGCTGCGAC CATCAGCTCA GGTTGGGAAA ATGGTGTTTT ATCAGGAAAC 600 

CAAACCTTAA CAGATCAGCC TATTGTTTTC CAAGGTTCAG CTCCAATTTA 650 

10 TTCTTGGTAT AAATTGGCAT ATGGATCTTT TCCTATTACA GCTGTGGAAG 700 

CCTTGGAGTA TTCATCCAAT GCTTACATGG TTCAAACCGC TCTTGGAATC 750 

ATGGGCCAGA CCTATCAACC AAATATGTTT GTTGGAACCA GCAATTTGGA 800 

AACAGCTATG GGAAAACTTC GTGCGACCTT TGGCGAATAT GGCTTGGGGG 850 

CTGCGACCGG AATTGACCTA CCAGATGAAT CTACTGGATT TGTTCCCAAA 900 

15 GAGTATAGCT TTGCTAATTA CATCACCAAT TCCTTTGGGC AGTTTGATAA 950 

CTATACGCCC ATGCAGTTGG CTCAGTATGT AGCAACTATT GCAAATAATG 1000 

GTGTTCGTGT GGCTCCTCGT ATTGTTGAAG GCATTTATGG TAATAATGAT 1050 

AAGGGAGGAC TGGGTGACTT GATTCAGCAA CTGCAACCGA CAGAGATGAA 1100 

TAAGGTCAAT ATATCCGACT CCGATATGAG CATCTTGCAC CAAGGTTTTT 1150 

20 ATCAGGTTGC CCATGGTACT AGTGGATTGA CAACTGGACG TGCCTTTTCA 1200 

AATGGTGCCT TGGTATCCAT TAGCGGAAAA ACAGGTACAG CCGAAAGCTA 1250 

TGTGGCAGAT GGTCAGCAAG CAACCAATAC CAATGCGGTG GCCTATGCCC 13 00 

CATCTGATAA TCCCCAAATC GCTGTCGCAG TGGTCTTTCC TCATAATACC 1350 

AATCTAACAA ATGGTGTAGG ACCTTCCATT GCGCGTGACA TTATCAATCT 14 00 

25 GTATCAAAAA TACCATCCAA TGAAC TAGAA AGGAAATTAT G 1441 



2) INFORMATION FOR SEQ ID NO: 1021 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1396 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-03 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1021 

4 5 TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 50 

GGGATCGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT AGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAT CCTATCTTAA 150 

AAAAGGCTAT TCTCTAAATG ACCGTGTTGG AACCTCCTAT TTGGAAAAGC 2 00 

AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 250 

50 GATAAATATG GCAATATGGA AAGCGTGGAC ACAATTGAGG AAGGTAGTAA 3 00 

GGGAAACAAT ATCAAACTGA CCATTGATTT GGCCTTCCAA GATAGCGTGG 350 

ATGCTTTGCT GAAAAGTTAT TTCAATTCCG AGCTAGGAAA TGGTGGAGCT 4 00 

AAATATTCTG AAGGTGTCTA TGCAGTCGCC CTTAACCCAA AAACAGGTGC 450 

TGTTTTATCC ATGTCAGGGA TCAAACATGA CCTGAAAACG GGAGAGTTGA 500 

55 CTCCTGATTC CTTGGGAACG GTAACCAATG TCTTTGTCCC AGGTTCGGTT 550 

GTCAAGGCTG CGACCATCAG CTCAGGTTGG GAAAATGGTG TTTTATCAGG 600 

AAACCAAACC TTAACAGATC AGCCTATTGT TTTCCAAGGT TCAGCTCCAA 650 

TTTATTCTTG GTATAAATTG GCATATGGAT CTTTTCCTAT TACAGCTGTG 700 

GAAGCCTTGG AGTATTCATC CAATGCTTAC ATGGTTCAAA CCGCTCTTGG 750 

60 AATCATGGGC CAGACCTATC AACCAAATAT GTTTGTTGGA ACCAGCAATT 800 

569 
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TGGAAACAGC TATGGGAAAA CTTCGTGCGA 
GGGGCTGCGA CCGGAATTGA CCTACCAGAT 
CAAAGAGTAT AGCTTTGCTA ATTACATCAC 
ATAACTATAC GCCCATGCAG TTGGCTCAGT 
5 AATGGTGTTC GTGTGGCTCC TCGTATTGTT 
TGATAAGGGA GGACTGGGTG ACTTGATTCA 
TGAATAAGGT CAATATATCC GACTCCGATA 
TTTTATCAGG TTGCCCATGG TACTAGTGGA 
TTCAAATGGT GCCTTGGTAT CCATTAGCGG 
10 GCTATGTGGC AGATGGTCAG CAAGCAACCA 
GCCCCATCTG ATAATCCCCA AATCGCTGTC 
TACCAATCTA ACAAATGGTG TAGGACCTTC 



PCT/CAOO/01150 



CCTTTGGCGA 


ATATGGCTTG 


850 


GAATCTACTG 


GATTTGTTCC 


900 


CAATTCCTTT 


GGGCAGTTTG 


950 


ATGTAGCAAC 


TATTGCAAAT 


1000 


GAAGGCATTT 


ATGGTAATAA 


1050 


GCAACTGCAA 


CCGACAGAGA 


1100 


TGAGCATCTT 


GCACCAAGGT 


1150 


TTGACAACTG 


GACGTGCCTT 


1200 


AAAAACAGGT 


ACAGCCGAAA 


1250 


ATACCAATGC 


GGTGGCCTAT 


1300 


GCAGTGGTCT 


TTCCTCATAA 


1350 


CATTGCGCGT 


GACATT 


1396 



15 



2) INFORMATION FOR SEQ ID NO: 1022 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1428 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-04 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1022 

AAAGGAGATG CCTGGCATTA GTATTTCTAC TTCTTGGGAT CGAAAGGTTT 50 

TGGAAACTTC CCTTTCTTCT ATAGTAGGGA GTGTATCCAG TGAAAAAGCT 100 

GGTCTCCCAG CGGAAGAAGC AGAATCCTAT CTTAAAAAAG GCTATTCTCT 150 

35 AAATGACCGT GTTGGAACCT CCTATTTGGA AAAGCAATAT GAAGAGACCT 200 

TACAAGGAAA ACGCTCGGTA AAAGAAATCC ATCTGGATAA ATATGGCAAT 250 

ATGGAAAGCG TGGACACAAT TGAGGAAGGT AGTAAGGGAA ACAATATCAA 3 00 

ACTGACCATT GATTTGGCCT TCCAAGATAG CGTGGATGCT TTGCTGAAAA 3 50 

GTTATTTCAA TTCCGAGCTA GGAAATGGTG GAGCTAAATA TTCTGAAGGT 400 

40 GTCTATGCAG TCGCCCTTAA CCCAAAAACA GGTGCTGTTT TATCCATGTC 450 

AGGGATCAAA CATGACCTGA AAACGGGAGA GTTGACTCCT GATTCCTTGG 500 

GAACGGTAAC CAATGTCTTT GTCCCAGGTT CGGTTGTCAA GGCTGCGACC 550 

ATCAGCTCAG GTTGGGAAAA TGGTGTTTTA TCAGGAAACC AAACCTTAAC 600 

AGATCAGCCT ATTGTTTTCC AAGGTTCAGC TCCAATTTAT TCTTGGTATA 650 

45 AATTGGCATA TGGATCTTTT CCTATTACAG CTGTGGAAGC CTTGGAGTAT 700 

TCATCCAATG CTTACATGGT TCAAACCGCT CTTGGAATCA TGGGCCAGAC 750 

CTATCAACCA AATATGTTTG TTGGAACCAG CAATTTGGAA ACAGCTATGG 800 

GAAAACTTCG TGCGACCTTT GGCGAATATG GCTTGGGGGC TGCGACCGGA 850 

ATTGACCTAC CAGATGAATC TACTGGATTT GTTCCCAAAG AGTATAGCTT 900 

50 TGCTAATTAC ATCACCAATT CCTTTGGGCA GTTTGATAAC TATACGCCCA 950 

TGCAGTTGGC TCAGTATGTA GCAACTATTG CAAATAATGG TGTTCGTGTG 1000 

GCTCCTCGTA TTGTTGAAGG CATTTATGGT AATAATGATA AGGGAGGACT 1050 

GGGTGACTTG ATTCAGCAAC TGCAACCGAC AGAGATGAAT AAGGTCAATA 1100 

TATCCGACTC CGATATGAGC ATCTTGCACC AAGGTTTTTA TCAGGTTGCC 1150 

55 CATGGTACTA GTGGATTGAC AACTGGACGT GCCTTTTCAA ATGGTGCCTT 12 00 

GGTATCCATT AGCGGAAAAA CAGGTACAGC CGAAAGCTAT GTGGCAGATG 12 50 

GTCAGCAAGC AACCAATACC AATGCGGTGG CCTATGCCCC ATCTGATAAT 13 0 0 

CCCCAAATCG CTGTCGCAGT GGTCTTTCCT CATAATACCA ATCTAACAAA 13 50 

TGGTGTAGGA CCTTCCATTG CGCGTGACAT TATCAATCTG TATCAAAAAT 1400 

60 ACCATCCAAT GAACTAGAAA GGAAATTA 1428 

570 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



2) INFORMATION FOR SEQ ID NO: 1023 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1442 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1023 

TGCCTCTATT TCAAAGGAGA TGCCTGGCAT TAGTATTTCT ACTTCTTGGG 50 

ATCGAAAGGT TTTGGAAACT TCCCTTTCTT CTATAGTAGG GAGTGTATCC 100 

AGTGAAAAAG CTGGTCTCCC AGCGGAAGAA GCAGAATCCT ATCTTAAAAA 150 

AGGCTATTCT CTAAATGACC GTGTTGGAAC CTCCTATTTG GAAAAGCAAT 200 

ATGAAGAGAC CTTACAAGGA AAACGCTCGG TAAAAGAAAT CCATCTGGAT 250 

AAATATGGCA ATATGGAAAG CGTGGACACA ATTGAGGAAG GTAGTAAGGG 300 

AAACAATATC AAACTGACCA TTGATTTGGC CTTCCAAGAT AGCGTGGATG 350 

CTTTGCTGAA AAGTTATTTC AATTCCGAGC TAGGAAATGG TGGAGCTAAA 400 

TATTCTGAAG GTGTCTATGC AGTCGCCCTT AACCCAAAAA CAGGTGCTGT 450 

TTTATCCATG TCAGGGATCA AACATGACCT GAAAACGGGA GAGTTGACTC 500 

CTGATTCCTT GGGAACGGTA ACCAATGTCT TTGTCCCAGG TTCGGTTGTC 550 

AAGGCTGCGA CCATCAGCTC AGGTTGGGAA AATGGTGTTT TATCAGGAAA 600 

CCAAACCTTA ACAGATCAGC CTATTGTTTT CCAAGGTTCA GCTCCAATTT 650 

ATTCTTGGTA TAAATTGGCA TATGGATCTT TTCCTATTAC AGCTGTGGAA 700 

GCCTTGGAGT ATTC AT CCAA TGCTTACATG GTTCAAACCG CTCTTGGAAT 750 

CATGGGCCAG ACCTATCAAC CAAATATGTT TGTTGGAACC AGCAATTTGG 800 

AAACAGCTAT GGGAAAACTT CGTGCGACCT TTGGCGAATA TGGCTTGGGG 850 

GCTGCGACCG GAATTGACCT ACCAGATGAA TCTACTGGAT TTGTTCCCAA 900 

AGAGTATAGC TTTGCTAATT ACATCACCAA TTCCTTTGGG CAGTTTGATA 950 

ACTATACGCC CATGCAGTTG GCTCAGTATG TAGCAACTAT TGCAAATAAT 1000 

GGTGTTCGTG TGGCTCCTCG TATTGTTGAA GGCATTTATG GTAATAATGA 1050 

TAAGGGAGGA CTGGGTGACT TGATTCAGCA ACTGCAACCG ACAGAGATGA 1100 

ATAAGGTCAA TATATCCGAC TCCGATATGA GCATCTTGCA CCAAGGTTTT 1150 

TATCAGGTTG CCCATGGTAC TAGTGGATTG ACAACTGGAC GTGCCTTTTC 12 00 

AAATGGTGCC TTGGTATCCA TTAGCGGAAA AACAGGTACA GCCGAAAGCT 1250 

ATGTGGCAGA TGGTCAGCAA GCAACCAATA CCAATGCGGT GGCCTATGCC 13 00 

CCATCTGATA ATCCCCAAAT CGCTGTCGCA GTGGTCTTTC CTCATAATAC 1350 

CAATCTAACA AATGGTGTAG GACCTTCCAT TGCGCGTGAC ATTATCAATC 1400 

TGTATCAAAA ATACCATCCA ATGAACTAGA AAGGAAATTA TG 1442 



2) INFORMATION FOR SEQ ID NO: 1024 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1445 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

571 
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5 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1024 



TTGCCTCTAT TTCAAAGGAG ATGCCTGGCA TTAGTATTTC TACTTCTTGG 50 

GATCGAAAGG TTTTGGAAAC TTCCCTTTCT TCTATAGTAG GGAGTGTATC 100 

10 CAGTGAAAAA GCTGGTCTCC CAGCGGAAGA AGCAGAATCC TATCTTAAAA 150 

AAGGCTATTC TCTAAATGAC CGTGTTGGAA CCTCCTATTT GGAAAAGCAA 2 00 

TATGAAGAGA CCTTACAAGG AAAACGCTCG GTAAAAGAAA TCCATCTGGA 250 

TAAATATGGC AATATGGAAA GCGTGGACAC AATTGAGGAA GGTAGTAAGG 3 00 

GAAACAATAT CAAACTGACC ATTGATTTGG CCTTCCAAGA TAGCGTGGAT 350 

15 GCTTTGCTGA AAAGTTATTT CAATTCCGAG CTAGGAAATG GTGGAGCTAA 400 

ATATTCTGAA GGTGTCTATG CAGTCGCCCT TAACCCAAAA ACAGGTGCTG 450 

TTTTATCCAT GTCAGGGATC AAACATGACC TGAAAACGGG AGAGTTGACT 500 

CCTGATTCCT TGGGAACGGT AACCAATGTC TTTGTCCCAG GTTCGGTTGT 550 

CAAGGCTGCG ACCATCAGCT CAGGTTGGGA AAATGGTGTT TTATCAGGAA 600 

20 ACCAAACCTT AACAGATCAG CCTATTGTTT TCCAAGGTTC AGCTCCAATT 650 

TATTCTTGGT ATAAATTGGC ATATGGATCT TTTCCTATTA CAGCTGTGGA 700 

AGCCTTGGAG TATTCATCCA ATGCTTACAT GGTTCAAACC GCTCTTGGAA 750 

TCATGGGCCA GACCTATCAA CCAAATATGT TTGTTGGAAC CAGCAATTTG 800 

GAAACAGCTA TGGGAAAACT TCGTGCGACC TTTGGCGAAT ATGGCTTGGG 850 

25 GGCTGCGACC GGAATTGACC TACCAGATGA ATCTACTGGA TTTGTTCCCA 900 

AAGAGTATAG CTTTGCTAAT TACATCACCA ATTCCTTTGG GCAGTTTGAT 950 

AACTATACAC CCATGCAGTT GGCTCAGTAT GTAGCAACTA TTGCAAATAA 1000 

TGGTGTTCGT GTGGCTCCTC GTATTGTTGA AGGCATTTAT GGTAATAATG 1050 

ATAAGGGAGG ACTGGGTGAC TTGATTCAGC AACTGCAACC GACAGAGATG 1100 

30 AATAAGGTCA ATATATCCGA CTCCGATATG AGCATCTTGC ACCAAGGTTT 1150 

TTATCAGGTT GCCCATGGTA CTAGTGGATT GACAACTGGA CGTGCCTTTT 12 00 

CAAATGGTGC CTTGGTATCC ATTAGCGGAA AAACAGGTAC AGCCGAAAGC 1250 

TATGTGGCAG ATGGTCAGCA AGCAACCAAT ACCAATGCGG TGGCCTATGC 1300 

CCCATCTGAT AATCCCCAAA TCGCTGTCGC AGTGGTCTTT CCTCATAATA 1350 

35 CCAATCTAAC AAATGGTGTA GGACCTTCCA TTGCGCGTGA CATTATCAAT 1400 

CTGTAT CAAA AATACCATCC AATGAACTAG AAAGGAAATT ATGCT 1445 



40 2) INFORMATION FOR SEQ ID NO: 1025 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1441 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-07 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1025 



TTGCCTCTAT TTCAAAGGAG ATGCCTGGCA TTAGTATTTC TACTTCTTGG 50 

GATCGAAAGA TTTTGGAAAC TTCCCTTTCT TCTATAGTAG GGAGTGTATC 100 

CAGTGAAAAA GCTGGTCTCC CAGCGGAAGA AGCAGAATCC TATCTTAAAA 150 

AAGGCTATTC TCTAAATGAC CGTGTTGGAA CCTCGTATTT GGAAAAGCAA 20 0 

60 TATGAAGAGA CCTTACAAGG AAAACGCTCG GTAAAAGAAA TCCATCTGGA 250 

572 
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TAAATATGGC AATATGGAAA GCGTGGACAC AATTGAGGAA GGTAGTAAGG 3 00 

GAAACAATAT CAAACTGACC ATTGATTTGG CCTTCCAAGA TAGCGTGGAT 3 50 

GCTTTGCTGA AAAGTTATTT CAATTCCGAG CTAGGAAATG GTGGAGCCAA 400 

GTATTCTGAG GGTGTGTATG CAGTCGCCCT TAACCCCAAA ACAGGTGCTG 450 

5 TTTTGTCTAT GTCAGGACTC AAACATGACC TGAAAACGGG AGAGTTGACT 500 

CCTGATTCCT TGGGAACGGT AACCAATGTC TTTGTCCCAG GTTCGGTTGT 550 

CAAGGCTGCG ACCATCAGCT CTGGCTGGGA AAATGGTGTT TTATCAGGAA 600 

ACCAAACCTT AACAGATCAG CCTATTGTTT TCCAAGGTTC AGCTCCAATT 650 

TATTCTTGGT ATAAATTGGC ATATGGATCT TTTCCTATTA CAGCTGTGGA 700 

10 AGCCTTGGAG TATTCATCCA ATGCTTACAT GGTTCAAACC GCTCTTGGAA 750 

TCATGGGCCA GACCTATCAA CCAAATATGT TTGTTGGAAC CAGCAATTTG 800 

GAAACAGCTA TGGGAAAACT TCGTGCGACC TTTGGCGAAT ATGGCTTGGG 850 

GGCTGCGACC GGAATTGACC TACCAGATGA ATCTACTGGA TTTGTTCCCA 900 

AAGACTATAG CTTTGCTAAT TACATCACCA ATGCCTTTGG GCAGTTTGAT 950 

15 AACTATACGC CCATGCAGTT GGCTCAGTAT GTAGCAACTA TTGCAAATGA 1000 

TGGTGTTCGT GTGGCTCCTC GTATTGTTGA AGGCATTTAT GGTAATAATG 1050 

ATAAGGGAGG ACTGGGTGAC TTGATTCAGC AACTGCAACC GACAGAGATG 1100 

AATAAGGTCA ATATATCCGA CTCCGATATG AGTATCTTGC ACCAAGGATT 1150 

TTACCAAGTA TGGCATGGAA CTAGTCCCCT TACGACAGGA CGGGCGTTTT 12 00 

20 CAGATGGCGC CACTGTTTCT ATCAGTGGTA AGACCGGTAC AGGTGAAAGC 1250 

TATGTAGCTG GTGGTCAAGA AGCTAATAAT ACCAATGCCG TGGCCTATGC 1300 

TCCAACAGAA AATCCTCAAA TTGCAGTTGC AGTAGTCTTT CCTCATAATA 1350 

CCAATTTAAC CAAAAATGTT GGGCCAGCAA TTGCTCGCGA CATTATCAAT 1400 

TTATATAACC AACACCATCC AATGAATTAG AAAGGAAGCC A 1441 

25 



2) INFORMATION FOR SEQ ID NO: 1026 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1443 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
40 (B) STRAIN: StrR-08 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1026 

TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 50 

45 GGGATCGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT AGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAT CCTATCTTAA 150 

AAAAGGCTAT TCTCTAAATG ACCGTGTTGG AACCTCCTAT TTGGAAAAGC 2 00 

AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 2 50 

GATAAATATG GCAATATGGA AAGCGTGGAC ACAATTGAGG AAGGTAGTAA 3 00 

50 GGGAAACAAT ATCAAACTGA CCATTGATTT GGCCTTCCAA GATAGCGTGG 350 

ATGCTTTGCT GAAAAGTTAT TTCAATTCCG AGCTAGGAAA TGGTGGAGCT 400 

AAATATTCTG AAGGTGTCTA TGCAGTCGCC CTTAACCCAA AAACAGGTGC 450 

TGTTTTATCC ATGTCAGGGA TCAAACATGA CCTGAAAACG GGAGAGTTGA 500 

CTCCTGATTC CTTGGGAACG GTAACCAATG TCTTTGTCCC AGGTTCGGTT 550 

55 GTCAAGGCTG CGACCATCAG CTCAGGTTGG GAAAATGGTG TTTTATCAGG 600 

AAACCAAACC TTAACAGATC AGCCTATTGT TTTCCAAGGT TCAGCTCCAA 650 

TTTATTCTTG GTATAAATTG GCATATGGAT CTTTTCCTAT TACAGCTGTG 700 

GAAGCCTTGG AGTATTCATC CAATGCTTAC ATGGTTCAAA CCGCTCTTGG 750 

AATCATGGGC CAGACCTATC AACCAAATAT GTTTGTTGGA ACCAGCAATT 800 

60 TGGAAACAGC TATGGGAAAA CTTCGTGCGA CCTTTGGCGA ATATGGCTTG 850 
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GGGGCTGCGA CCGGAATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 900 

CAAAGAGTAT AGCTTTGCTA ATTACATCAC CAATTCCTTT GGGCAGTTTG 950 

ATAACTATAC GCCCATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 1000 

AATGGTGTTC GTGTGGCTCC TCGTATTGTT GAAGGCATTT ATGGTAATAA 1050 

5 TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 1100 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGCATCTT GCACCAAGGT 1150 

TTTTATCAGG TTGCCCATGG TACTAGTGGA TTGACAACTG GACGTGCCTT 1200 

TTCAAATGGT GCCTTGGTAT CCATTAGCGG AAAAACAGGT ACAGCCGAAA 1250 

GCTATGTGGC AGATGGTCAG CAAGCAACCA ATACCAATGC GGTGGCCTAT 1300 

10 GCCCCATCTG ATAATCCCCA AATCGCTGTC GCAGTGGTCT TTCCTCATAA 1350 

TACCAATCTA ACAAATGGTG TAGGACCTTC CATTGCGCGT GACATTATCA 1400 

ATCTGTATCA AAAATACCAT CCAATGAACT AGAAAGGAAA TTA 1443 



15 



25 



2) INFORMATION FOR SEQ ID NO: 1027 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1443 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-09 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1027 

TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 50 

GGGATAGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT TGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAG CCTATCTTAA 150 

3 5 AAAAGGCTAT TCTCTAAATG ATCGTGTAGG AACCTCCTAT TTGGAAAAGC 2 00 

AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 250 

GATAAATATG GCAATATGGA AAGCGTGGAT ACAATTGAGG AAGGTAGTAA 3 00 

GGGAAACAAT ATCAAACTGA CCATTGATTT GGCTTTCCAA GATAGCGTGG 3 50 

ATGCTTTACT GAAAAGTTAT TTCAATTCCG AGCTAGAAAA TGGTGGAGCC 400 

40 AAGTATTCTG AAGGTGTCTA TGCAGTCGCC CTTAACCCAA AAACAGGTGC 4 50 

TGTTTTGTCT ATGTCAGGGA TTAAACATGA CTTGAAAACG GGAGATTTAA 500 

CACCTGATTC CTTGGGAACA GTAACCAATG TCTTTGTCCC GGGTTCTGTT 550 

GTCAAGGCGG CGACCATCAG CTCTGGTTGG GAGAATGGAG TCTTATCAGG 600 

AAATCAGACC TTGACAGACC AACCGATTGT CTTCCAAGGT TCAGCTCCGA 650 

45 TTAATTCTTG GTACACTCAG GCTTACGATT CATTTCCGAT TACAGCGGTG 700 

GAAGCCTTGG AGTATTCTTC TAATGCCTAT ATGGTCCAAA CAGCTCTAGG 750 

TCTTATGGGG CAGACCTACC AACCCAATAT GTTTGTCGGC ACCAGCAATC 800 

TAGAGTCTGC TATGGGGAAA TTGCGTTCAA CCTTTGGTGA ATATGGCTTG 850 

GGCTCTGCGA CTGGGATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 900 

50 CAAAGAGTAT AGCTTTGCTA ATTACATTAC TAATGCCTTT GGGCAGTTTG 950 

ATAACTATAC GCCGATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 1000 

GATGGTGTTC GTGTGGCTCC TCGTATTGTG GAAGGCATTT ATGGCAATAA 1050 

TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 1100 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGTATCTT GCACCAAGGT 1150 

55 TTTTATCAGG TTGCTCATGG GACTAGCGGA TTGACAACAG GTCGTGCCTT 1200 

TTCCAATGGT GCAGCTGTAT CCATTAGTGG AAAAACAGGT ACCGCCGAAA 12*50 

GTTATGTAGC AGGTGGCCAA GAAGCCAACA ATACTAATGC TGTAGCCTAT 1300 

GCACCATCAG ATAATCCTCA AATAGCTGTT GCTGTTGTCT TCCCTCATAA 1350 

CACCAATCTA ACAAATGGTG TAGGACCTTC CATTGCGCGT GATATTATCA 14 00 

60 ATCTGTATCA AAAATACCAT CCAATGAACT AGAAAGGAAT TTA 1443 

574 
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2) INFORMATION FOR SEQ ID NO: 1028 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1446 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

<ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1028 

TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACCTCTT 50 

GGGATAGAAA GGTTTTGGAA ACTTCTCTTT CTTCTATAGT AGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAG CCTATCTTAA 150 

AAAAGGCTAT TCTCTAAATG ACCGTGTTGG GACTTCTTAC CTGGAAAAAC 200 

AATACGAGGA AACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 250 

GATAAATATG GCAATATGGA AAGCGTGGAT ACAATTGAGG AAGGTAGTAA 300 

GGGAAACAAT ATCAAACTGA CCATTGATTT GGCCTTCCAA GATAGCGTGG 350 

ATGCTTTGCT GAAAAGTTAT TTCAATTCCG AGCTAGGAAA TGGTGGAGCT 400 

AAGTATTCTG AAGGCGTGTA TGCAGTCGCC CTTAACCCCA AAACAGGTGC 450 

TGTTTTGTCT ATGTCAGGAC TCAAACATGA CCTGAAAACG GGAGACTTGA 500 

CGCCTGATTC CTTGGGAACG GTAACCAATG TCTTTGTCCC AGGGTCAGTA 550 

GTTAAGGCCG CTACCATCAG CTCAGGTTGG GAAAATGGTG TTTTATCAGG 600 

AAACCAAACC TTAACAGATC AGCCTATTGT TTTCCAAGGT TCAGCTCCAA 650 

TTTATTCTTG GTATAAATTG GCATATGGAT CTTTTCCTAT TACAGCTGTG 700 

GAAGCCTTGG AGTATTCATC TAATGCTTAC ATGGTTCAAA CCGCTCTTGG 750 

AATCATGGGC CAGACCTATC AACCAAATAT GTTTGTTGGA ACCAGCAATT 800 

TGGAAACAGC TATGGGAAAA CTTCGTGCGA CCTTTGGCGA ATATGGCTTG 850 

GGGGCTGCGA CCGGAATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 900 

CAAAGAGTAT AGCTTTGCTA ATTACATTAC TAATGCCTTT GGGCAGTTTG 950 

ATAACTATAC GCCGATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 100 0 

GATGGTGTTC GTGTGGCTCC TCGTATTGTT GAAGGCATTT ATGGTAATAA 1050 

TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 1100 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGCATCTT GCACCAAGGT 1150 

TTTTATCAGG TTGCCCATGG TACTAGTGGA TTGACAACTG GACGTGCCTT 1200 

TTCAAATGGC GCCTTGGTAT CCATTAGCGG AAAAACAGGT AC AGC CGAAA 1250 

GCTATGTGGC AGATGGTCAG CAAGCAACCA ATACCAATGC GGTGGCCTAT 1300 

GCCCCATCTG ATAATCCCCA AATCGCTGTT GCAGTGGTCT TTCCTCATAA 1350 

TACCAATCTA ACAAATGGTG TAGGACCTTC CATTGCGCGT GACATTATCA 1400 

ATCTGTATCA AAAATAC CAT CCAATGAACT AGAAAGGAAA TTATGC 144 6 



2) INFORMATION FOR SEQ ID NO: 1029 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1423 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

575 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1029 

TTGCCTCTAT TTCAAAGGAG ATGCCTGGCA TTAGTATTTC TACTTCTTGG 50 

GATAGAAAGG TTTTGGAAAC TTCCCTTTCT TCTATAGTTG GGAGTGTATC 10 0 

CAGTGAAAAA GCTGGTCTCC CAGCGGAAGA AGCAGAAGCC TATCTTAAAA 150 

AAGGCTATTC TCTAAATGAC CGTGTAGGAA CCTCCTATTT GGAAAAG CAA 200 

TATGAAGAGA CCTTACAAGG AAAACGCTCG GTAAAAGAAA TCCATCTGGA 250 

TAAATATGGC AATATGGAAA GCGTGGATAC AATTGAGGAA GGTAGTAAGG 300 

GAAACAATAT CAAACTGACC ATTGATTTGG CCTTCCAAGA TAGCGTGGAT 350 

GCTTTACTGA AAAGTTATTT CAATTCTGAG CTAGAAAATG GTGGAGCCAA 400 

GTATTCTGAA GGTGTCTATG CAGTCGCCCT TAACCCAAAA ACAGGTGCGG 450 

TTTTGTCTAT GTCAGGGATT AAACATGACT TGAAAACGGG AGAGTTGACG 500 

CCTGATTCCT TGGGAACGGT AACCAATGTC TTTGTTCCAG GTTCGGTTGT 550 

CAAGGCGGCG AC C ATCAGCT CAGGTTGGGA AAATGGAGTC TTGTCAGGAA 600 

ACCAGACCTT GACAGACCAG TCCATTGTCT TCCAAGGTTC AGCTCCCATC 650 

AATTCTTGGT ATACTCAGGC TTACGGTTCA TTCCCTATCA CAGCGGTCCA 700 

AGCTCTGGAG TATTCATCCA ATGCTTATAT GGTCCAAACA GCCTTAGGTC 750 

TTATGGGGCA GACCTATCAA CCCAATATGT TTGTCGGCAC CAGCAATCTA 800 

GAGTCTGCTA TGGGTAAATT GCGTTCAACC TTTGGCGAAT ATGGCTTGGG 850 

GGCTGCGACT GGGATTGATC TACCAGATGA ATCTACTGGA TTTGTTCCCA 900 

AAGACTATAA CTTTGCCAAT TTCATTACCA ATGCCTTTGG GCAGTTTGAT 950 

AACTATACCC CAATGCAATT GGCTCAGTAT GTAGCAACTA TTGCAAATGA 1000 

TGGTGTTCGT GTGGCTCCTC GTATTGTTGA AGGCATTTAT GGTAATAATG 1050 

ATAAGGGAGG ACTGGGTGAC TTGATTCAGC AACTGCAACC GACAGAGATG 1100 

AATAAGGTCA ATATATCCGA CTCCGATATG AGTGTCTTGC ACCAAGGTTT 1150 

TTATCAGGTT GCTCATGGGA CTAGTGGGTT GACAACTGGC CGTGCCTTTT 12 00 

CAAATGGTGC CTTGGTATCC ATTAGCGGAA AAACGGGTAC AGCCGAAAGC 12 50 

TATGTGGCAG ATGGTCAGGA AGCAACCAAT ACCAATGCGG TGGCCTATGC 13 00 

CCCATCTGAT AATCCCCAAA TCGCTGTCGC AGTGGTCTTT CCTCATAATA 13 50 

CCAATCTAAC AAATGGTGTA GGACCTTCCA TTGCGCGTGA CATTATCAAT 1400 

CTGTATCAAA AATACCATCC AAT 1423 



2) INFORMATION FOR SEQ ID NO: 1030 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 7 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-12 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 0 

TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 50 

GGGATCGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT AGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAT CCTATCTTAA 150 

AAAAGGCTAT TCTCTAAATG ACCGTGTTGG AACCTCCTAT TTGGAAAAGC 2 00 

AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 2 50 

576 
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GATAAATATG GCAATATGGA AAGCGTGGAC ACAATTGAGG AAGGTAGTAA 300 

GGGAAACAAT ATCAAACTGA CCATTGATTT GGCCTTCCAA GATAGCGTGG 350 

ATGCTTTGCT GAAAAGTTAT TTCAATTCCG AGCTAGGAAA TGGTGGAGCT 400 

AAATATTCTG AAGGTGTCTA TGCAGTCGCC CTTAACCCAA AAACAGGTGC 450 

5 TGTTTTATCC ATGTCAGGGA TCAAACATGA CCTGAAAACG GGAGAGTTGA 500 

CTCCTGATTC CTTGGGAACG GTAACCAATG TCTTTGTCCC AGGTTCGGTT 550 

GTCAAGGCTG CGACCATCAG CTCAGGTTGG GAAAATGGTG TTTTATCAGG 600 

AAACCAAACC TTAACAGATC AGCCTATTGT TTTCCAAGGT TCAGCTCCAA 650 

TTTATTCTTG GTATAAATTG GCATATGGAT CTTTTCCTAT TACAGCTGTG 700 

10 GAAGCCTTGG AGTATTCATC CAATGCTTAC ATGGTTCAAA CCGCTCTTGG 750 

AATCATGGGC CAGACCTATC AACCAAATAT GTTTGTTGGA ACCAGCAATT 800 

TGGAAACAGC TATGGGAAAA CTTCGTGCGA CCTTTGGCGA ATATGGCTTG 850 

GGGGCTGCGA CCGGAATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 900 

CAAAGAGTAT AGCTTTGCTA ATTACATCAC CAATTCCTTT GGGCAGTTTG 950 

15 ATAACTATAC GCCCATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 1000 

AATGGTGTTC GTGTGGCTCC TCGTATTGTT GAAGGCATTT ATGGTAATAA 1050 

TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 1100 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGCATCTT GCACCAAGGT 1150 

TTTTATCAGG TTGCCCATGG TACTAGTGGA TTGACAACTG GACGTGCCTT 1200 

20 TTCAAATGGT GCCTTGGTAT CCATTAGCGG AAAAACAGGT ACAGCCGAAA 1250 

GCTATGTGGC AGATGGTCAG CAAGCAACCA ATACCAATGC GGTGGCCTAT 1300 

GCCCCATCTG ATAATCCCCA AATCGCTGTC GCAGTGGTCT TTCCTCATAA 1350 

TACCAATCTA ACAAATGGTG TAGGACCTTC CATTGCGCGT GACATTATCA 1400 

ATCTGTATCA AAAATACCAT CCAATGAACT AGAAAGGAAA TTATGCT 144 7 

25 



2) INFORMATION FOR SEQ ID NO: 1031 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 6 bases 

(B) , TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
40 (B) STRAIN: StrR-13 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1031 

ATTGCCTCTA TTTCAAAGGA GATGCCTGGC ATTAGTATTT CTACCTCTTG 50 

4 5 GGATAGAAAG GTTTTGGAAA CTTCTCTTTC TTCTATAGTA GGGAGTGTAT 100 

CCAGTGAAAA AGCTGGTCTC CCAGCGGAAG AAGCAGAAGC CTATCTTAAA 150 

AAAGGCTATT CTCTAAATGA CCGTGTTGGG ACTTCTTACC TGGAAAAACA 2 00 

ATACGAGGAA ACCTTACAAG GAAAACGCTC GGTAAAAGAA ATCCATCTGG 250 

ATAAATATGG CAATATGGAA AGCGTGGATA CAATTGAGGA AGGTAGTAAG 300 

50 GGAAACAATA TCAAACTGAC CATTGATTTG GCCTTCCAAG ATAGCGTGGA 3 50 

TGCTTTGCTG AAAAGTTATT TCAATTCCGA GCTAGGAAAT GGTGGAGCTA 4 00 

AGTATTCTGA AGGCGTGTAT GCAGTCGCCC TTAACCCCAA AACAGGTGCT 4 50 

GTTTTGTCTA TGTCAGGACT CAAACATGAC CTGAAAACGG GAGACTTGAC 500 

GCCTGATTCC TTGGGAACGG TAACCAATGT CTTTGTCCCA GGGTCAGTAG 550 

55 TTAAGGCCGC TACCATCAGC TCAGGTTGGG AAAATGGTGT TTTATCAGGA 600 

AACCAAACCT TAACAGATCA GCCTATTGTT TTCCAAGGTT CAGCTCCAAT 650 

TTATTCTTGG TATAAATTGG CATATGGATC TTTTCCTATT ACAGCTGTGG 700 

AAGCCTTGGA GTATTCATCT AATGCTTACA TGGTTCAAAC CGCTCTTGGA 750 

ATCATGGGCC AGACCTATCA ACCAAATATG TTTGTTGGAA CCAGCAATTT 800 

60 GGAAACAGCT ATGGGAAAAC TTCGTGCGAC CTTTGGCGAA TATGGCTTGG 850 

577 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



GGGCTGCGAC CGGAATTGAC CTACCAGATG AATCTACTGG ATTTGTTCCC 900 

AAAGAGTATA GCTTTGCTAA TTACATTACT AATGCCTTTG GGCAGTTTGA 950 

TAACTATACG CCGATGCAGT TGGCTCAGTA TGTAGCAACT ATTGCAAATG 1000 

ATGGTGTTCG TGTGGCTCCT CGTATTGTTG AAGGCATTTA TGGTAATAAT 1050 

5 GATAAGGGAG GACTGGGTGA CTTGATTCAG CAACTGCAAC CGACAGAGAT 1100 

GAATAAGGTC AATATATCCG ACTCCGATAT GAGCATCTTG CACCAAGGTT 1150 

TTTATCAGGT TGCCCATGGT ACTAGTGGAT TGACAACTGG ACGTGCCTTT 1200 

TCAAATGGCG CCTTGGTATC CATTAGCGGA AAAACAGGTA CAGCCGAAAG 1250 

CTATGTGGCA GATGGTCAGC AAGCAACCAA TACCAATGCG GTGGCCTATG 1300 

10 CCCCATCTGA TAATCCCCAA ATCGCTGTTG CAGTGGTCTT TCCTCATAAT 1350 

ACCAATCTAA CAAATGGTGT AGGACCTTCC ATTGCGCGTG ACATTATCAA 1400 

TCTGTATCAA AAATACCATC CAATGAACTA GAAAGGAAAT TATGCT 1446 



15 



2) INFORMATION FOR SEQ ID NO: 1032 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1446 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-14 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1032 

ATTGCCTCTA TTTCAAAGGA GATGCCTGGC ATTAGTATTT CTACTTCTTG 50 

GGATAGAAAG GTTTTGGAAA CTTCCCTTTC TTCTATAGTT GGGAGTGTAT 100 

CCAGTGAAAA AGCTGGTCTC CCAGCGGAAG AAGCAGAAGC CTATCTTAAA 150 

35 AAAGGCTATT CTCTAAATGA TCGTGTTGGA ACCTCCTATT TGGAAAAGCA 2 00 

ATATGAAGAG ACCTTACAAG GGAAACGCTC GGTAAAAGAA ATCCATCTGG 250 

ATAAATATGG CAATATGGAA AGTGTGGATA CAATTGAGGA AGGTAGTAAG 3 00 

GGAAACAATA TCAAGCTGAC CATTGATTTG GCCTTCCAAG ATAGCGTGGA 350 

TGCTTTGCTG AAAAGTTATT TCAATTCCGA GCTAGGAAAT GGTGGAGCCA 4 00 

40 AGTATTCTGA GGGTGTGTAT GCAGTCGCCC TTAACCCCAA AACAGGTGCT 450 

GTTTTGTCTA TGTCAGGACT CAAACATGAC CTGAAAACGG GAGAGTTGAC 500 

TCCTGATTCC TTGGGAACGG TAACCAATGT CTTTGTCCCA GGTTCGGTTG 550 

TCAAGGCTGC GACCATCAGC TCTGGCTGGG AAAATGGAGT CTTATCAGGA 600 

AACCAGACCT TGACAGACCA GTCCATTGTC TTTCAAGGTT CAGCTCCCAT 650 

4 5 CAATTCTTGG TATACTCAGG CTTACGGTTC ATTCCCTATC ACAGCAGTCC 700 

AAGCTCTGGA GTATTCATCT AATGCCTATA TGGTCCAAAC AGCTTTAGGT 750 

CTTATGGGGC AGACCTACCA ACCTAATATG TTTGTCGGCA CCAGCAACCT 800 

AGAGTCTGCT ATGGGGAAAT TGCGTTCAAC CTTTGGTGAA TATGGTTTGG 850 

GTTCTGCGAC CGGGATTGAC CTACCAGATG AATCTACTGG ATTTGTTCCC 900 

50 AAAGACTATA GCTTTGCTAA TTACATCACC AATGCCTTTG GGCAGTTTGA 950 

TAACTATACG CCGATGCAGT TGGCTCAGTA TGTAGCAACT ATTGCAAATG 1000 

ATGGTGTTCG TGTGGCTCCT CGTATTGTTG AAGGCATTTA TGGAAATAAT 1050 

GATAAGGGAG GCCTAGGCGA CTTGATTCAG CAACTGCAAC CGACAGAGAT 1100 

GAATAAGGTC AATATATCCG ACTCTGATAT GAGTATTTTG CACCAAGGTT 1150 

55 TTTATCAGGT TGCTCATGGG ACTAGTGGAT TGACAACTGG ACGTGCCTTT 12 00 

TCAAATGGCG CAGCGGTATC CATTAGTGGA AAAACAGGTA CTGCCGAAAG 1250 

TTATGTTGAG GGTGGTCAAG AAGCTAACAA TACTAATGCT GTGGCCTATG 13 00 

CACCATCAGA TAATCCTCAA ATCGCTGTAG CTGTTGTCTT CCCTCATAAC 1350 

ACCAACCTTA CAAATGGTGT CGGACCTTCC ATTGCGCGCG ATATTATCAA 14 00 

60 CCTCTATAAC CAACATCATC CAATGAATTA GAAAGGAACA TATGCT 1446 

578 
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2) INFORMATION FOR SEQ ID NO: 1033 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1446 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1033 

TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 50 

GGGATAGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT TGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAG CCTATCTTAA 150 

AAAAGGCTAT TCTCTAAATG ACCGTGTTGG AACCTCGTAT TTGGAAAAGC 200 

AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 250 

GATAAATATG GCAATATGGA AAGCGTGGAT ACAATTGAGG AAGGTAGTAA 3 00 

GGGAAACAAT ATCAAGCTGA CCATTGATTT GGCCTTCCAA GATAGCGTGG 350 

ATGCTTTGCT GAAAAGTTAT TTCAATTCCG AGCTAGGAAA TGGTGGAGCC 400 

AAGTATTCTG AGGGTGTGTA TGCAGTCGCC CTTAACCCCA AAACAGGTGC 450 

TGTTTTGTCT ATGTCAGGAC TCAAACATGA CCTGAAAACG GGAGAGTTGA 500 

CTCCTGATTC CTTGGGAACG GTAACCAATG TCTTTGTCCC AGGTTCGGTT 550 

GTTAAGGCCG CTACCATCAG CTCAGGTTGG GAAAATGGTG TTTTATCAGG 600 

AAACCAAACC TTAACAGATC AGCCTATTGT TTTCCAAGGT TCAGCTCCAA 650 

TTTATTCTTG GTATAAATTG GCATATGGAT CTTTTCCTAT TACAGCTGTG 700 

GAAGCCTTGG AGTATTCATC CAATGCTTAC ATGGTTCAAA CCGCTCTTGG 750 

AATCATGGGC CAGACCTATC AACCAAATAT GTTTGTTGGA ACCAGCAATT 800 

TGGAAACAGC TATGGGAAAA CTTCGTGCGA CCTTTGGCGA ATATGGCTTG 850 

GGGGCTGCGA CCGGAATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 900 

CAAAGAGTAT AGCTTTGCTA ATTACATCAC CAATGCCTTT GGGCAGTTTG 950 

ATAACTATAC GCCCATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 1000 

GATGGTGTTC GTGTGGCTCC TCGTATTGTT GAAGGCATTT ATGGTAATAA 1050 

TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 1100 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGTATCTT GCACCAAGGT 1150 

TTTTATCAGG TTGCTCATGG GACTAGTGGA TTGACAACTG GACGTGCTTT 1200 

TTCAAATGGT GCCTTGGTAT CCATTAGTGG GAAAACAGGT ACTGCCGAAA 1250 

GTTATGTTGC AGGTGGTCAA GAAGCCAACA ATACCAACGC GGTGGCCTAT 13 00 

GCCCCATCAG ATAATCCTCA AATCGCTGTT GCCGTTGTCT TCCCTCATAA 1350 

CACCAATCTA ACAAATGGTG TTGGACCTTC TATTGCACGC GATATTATCA 14 00 

ACCTCTATAA CCAACACCAT CCAATGAATT AGAAAGGAAC TTATGC 1446 



2) INFORMATION FOR SEQ ID NO: 1034 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

579 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-01 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1034 



GGAAACTGCA GAGGTCAAGG GGATTGATTT TACAACCAGT CCCAATCGTA 50 

GTTATCCAAA CGGACAATTT GCTTCTAGTT TTATCGGACT AGCTCAGCTC 100 

10 CATGAAAATG AAGATGGCAG CAAGAGCTTA CTGGGAACCT CTGGAATGGA 150 

GAGTTCCTTG AACAGTATTC TTGCAGGGAC AGACGGTATT ATTACCTATG 200 

AAAAAGACCG TGTAGGAAAT ATCGTACCAG GTACAGAACT GGTATCGCAA 250 

CAAACTGTGG ATGGCAAGGA TGTTTATACA ACATTGTCTA GTCCGCTACA 300 

ATCTTTCATG GAAACTCAGA TGGATGCCTT TCTAGAAAAA GTAAAAGGTA 350 

15 AGTATATGAC CGCGACCTTG GTCAGTGCAA AGACCGGTGA AATCCTCGCT 4 00 

ACCACCCAAC GACCTACCTT TAATGCAGAT ACTAAAGAAG GAATCACTGA 450 

GGACTTTGTT TGGCGTGATA TTCTTTATCA AAGTAACTAT GAACCAGGAT 500 

CAGCCATGAA GGTTATGACG TTAGCTTCTT CTATTGATAA TAATACCTTC 550 

CCAAGTGGAG AATACTTCAA TAGCAGTGAA TTCAAAATAG CGGATGCGAC 600 

20 GACTCGAGAT TGGGATGTTA ATGATGGTTT GACTACTGGT GGGATGATGA 650 

CTTTCTTACA AGGTTTCGCT CACTCCAGTA ATGTTGGAAT GAGTCTACTT 700 

GAACAAAAAA TGGGAGATGC TACTTGGTTG GATTATCTAA AACGCTTTAA 750 

ATTTGGGGTT CCAACTCGCT TTGGCTTGAC AGATGAATAC GCTGGTCAAC 800 

TTCCAGCTGA TAATATTGTT AGTATTGCTC AAAGCTCATT TGGG C AAGGA 850 

25 . ATTTCAGTGA CACAAACACA AATGCTTCGT GCCTTTACAG CTATTGCTAA 900 

TGATGGAGTT ATGCTGGAGC CAAAATTTAT AAGTGCTATT TATGATACTA 950 

ACAATCAGTC TGTACGTAAG TCACAAAAAG AAATAGTAGG AAAT CCTGTT 1000 

TCCAAAGAGG CAGCAAGCAC AACTCGAAAT CACATGATCT TAGTTGGGAC 1050 

GGACCCTCTA TATGGAACTA TGTATAATCA CTACACAGGA AAGC C AATTA 1100 

30 TAACAGTTCC TGGACAAAAT GTAGCAGTTA AATCCGGTAC GGCTCAAATC 1150 

GCTGATGAGA AAAATGGAGG ATACTTGGTT GGTTCTACCA ATTATATTTT 1200 

CTCAGTTGTG ACTATGAATC CTGCTGAAAA TCCTGATTTT ATCTTGTATG 1250 

TAACGGTTCA ACAGCCTGAG CATTATTCAG GTATCCAGTT GGGAGAATTT 1300 

GCCACCCCAA TCTTGGAGCG GGCTTCAGCT ATGAAAGAAT CTCTCAATCT 1350 

35 TCAATCTCCA GCCAAAAATT TAGATAAAGT TACGACAGAA TCTTCTTATG 1400 

CAATGCCTAG CATCAAGGAT ATTTCACCTG GTGAGTTGGC GGAAGCCTTA 1450 

CGCCGAAATA TTGTGCAACC AATCGTTGTA GGTACTGGAA CAAAGATTAA 1500 

AGAGACTTCT GTAGAAGAAG GGACCAATCT TGCACCAAAC CAACAAGTTC 1550 

TCCTTTTATC GGATAAGGTA GAAGAAATTC CAGACATGTA TGGCTGGAAA 1600 

4 0 AAAGAGACTG CCGAGACCTT TGCTAAATGG TTGGATATTG AACTGGAATT 1650 

TGAAGGTTCA GGTTCCGTTG 1670 



45 2) INFORMATION FOR SEQ ID NO: 1035 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1683 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-02 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1035 

580 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/C A00/01 150 

AAAGAATTGG AAACTGCAGA GGTCAAGGGG ATTGATTTTA CAACCAGTCC 50 

CAATCGTAGT TACCCAAATG GACAATTTGC TTCTAGTTTT ATCGGTCTAG 10 0 

CTCAGCTCCA TGAAAATGAA GATGGAAGCA AGAGTTTGCT GGGAACCTCT 150 

GGAATGGAGA GTTCCTTGAA CAGTATTCTT GCAGGGACAG ACGGCATTAT 2 00 

5 TACCTATGAA AAGGATCGTC TGGGCAATAT TGTACCCGGA ACAGAACAAG 250 

TTTCCCAACA AACGGTGGAT GGCAAGGATG TTTACACAAC CATTTCCAGC 3 00 

CCCCTCCAGT CCTTCATGGA AACTCAGATG GATGCCTTTC TAGAAAAAGT 350 

AAAAGGTAAG TATATGACCG CGACCTTGGT CAGTGCAAAG ACCGGTGAAA 400 

TCCTCGCTAC CACCCAACGA CCTACCTTTA ATGCAGATAC TAAAGAAGGA 450 

10 ATCACTGAGG ACTTTGTTTG GCGTGATATT CTTTATCAAA GTAACTATGA 500 

ACCAGGATCA GCCATGAAGG TTATGACGTT AGCTTCTTCT ATTGATAATA 550 

ATACCTTCCC AAGTGGAGAA TACTTCAATA GCAGTGAATT CAAAATAGCG 600 

GATGCGACGA CTCGAGATTG GGATGTTAAT GATGGTTTGA CTACTGGTGG 650 

GATGATGACT TTCTTACAAG GTTTCGCTCA CTCCAGTAAT GTTGGAATGA 700 

15 GTCTACTTGA ACAAAAAATG GGAGATGCTA CTTGGTTGGA TTATCTAAAA 750 

CGCTTTAAAT TTGGGGTTCC AACTCGCTTT GGCTTGACAG ATGAATACGC 800 

TGGTCAACTT CCAGCTGATA ATATTGTTAG TATTGCTCAA AGCTCATTTG 850 

GGCAAGGAAT TTCAGTGACA CAAACACAAA TGCTTCGTGC CTTTACAGCT 900 

ATTGCTAATG ATGGAGTTAT GCTGGAGCCA AAATTTATAA GTGCTATTTA 950 

20 TGATACTAAC AATCAGTCTG TACGTAAGTC ACAAAAAGAA ATAGTAGGAA 1000 

ATCCTGTTTC CAAAGAGGCA GCAAGCACAA CTCGAAATCA CATGATCTTA 1050 

GTTGGGACGG ACCCTCTATA TGGAACTATG TATAATCACT ACACAGGAAA 1100 

GCCAATTATA ACAGTTCCTG GACAAAATGT AGCAGTTAAA TCCGGTACGG 1150 

CTCAAATCGC TGATGAGAAA AATGGAGGAT ACTTGGTTGG TTCTACCAAT 12 00 

25 TATATTTTCT CAGTTGTGAC TATGAATCCT GCTGAAAATC CTGATTTTAT 1250 

CTTGTATGTA ACGGTTCAAC AGCCTGAGCA TTATTCAGGT ATCCAGTTGG 13 00 

GAGAATTTGC CACCCCAATC TTGGAGCGGG CTTCAGCTAT GAAAGAATCT 1350 

CTCAATCTTC AATCTCCAGC CAAAAATTTA GATAAAGTTA CGACAGAATC 1400 

TTCTTATGCA ATGCCTAGCA TCAAGGATAT TTCACCTGGT GAGTTGGCGG 1450 

3 0 AAGCCTTACG CCGAAATATT GTGCAACCAA TCGTTGTAGG TACTGGAACA 1500 

AAGATTAAAG AGACTTCTGT AGAAGAAGGG ACCAATCTTG CACCAAACCA 1550 

ACAAGTTCTC CTTTTATCGG ATAAGGTAGA AGAAATTCCA GACATGTATG 1600 

GCTGGAAAAA AGAGACTGCC GAGACCTTTG CTAAATGGTT GGATATTGAA 1650 

CTGGAATTTG AAGGTTCAGG TTCCGTTGTT CAG 1683 

35 



2) INFORMATION FOR SEQ ID NO: 1036 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1682 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
50 (B) STRAIN: StrR-03 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 6 

TCAAAAAAGA ATTGGAAACT GCAGAGGTCA AGGGGATTGA TTTTACAACC 50 

55 AGTCCCAATC GTAGTTACCC AAATGGACAA TTTGCTTCTA GTTTTATCGG 100 

TCTAGCTCAG CTCCATGAAA ATGAAGATGG AAGCAAGAGT TTGCTGGGAA 150 

CCTCTGGAAT GGAGAGTTCC TTGAACAGTA TTCTTGCAGG GACAGACGGC 200 

ATTATTACCT ATGAAAAGGA TCGTCTGGGC AATATTGTAC CCGGAACAGA 250 

ACAAGTTTCC CAACAAACGG TGGATGGCAA GGATGTTTAC ACAACCATTT 300 

60 CCAGCCCCCT CCAGTCCTTC ATGGAAACTC AGATGGATGC CTTTCTAGAA 350 
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SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

AAAGTAAAAG GTAAGTATAT GACCGCGACC TTGGTCAGTG CAAAGACCGG 400 

TGAAATCCTC GCTACCACCC AACGACCTAC CTTTAATGCA GATACTAAAG 450 

AAGGAATCAC TGAGGACTTT GTTTGGCGTG ATATTCTTTA TCAAAGTAAC 500 

TATGAACCAG GATCAGCCAT GAAGGTTATG ACGTTAGCTT CTTCTATTGA 550 

TAATAATACC TTCCCAAGTG GAGAATACTT CAATAGCAGT GAATTCAAAA 600 

TAGCGGATGC GACGACTCGA GATTGGGATG TTAATGATGG TTTGACTACT 650 

GGTGGGATGA TGACTTTCTT ACAAGGTTTC GCTCACTCCA GTAATGTTGG 700 

AATGAGTCTA CTTGAACAAA AAATGGGAGA TGCTACTTGG TTGGATTATC 750 

TAAAACGCTT TAAATTTGGG GTTCCAACTC GCTTTGGCTT GACAGATGAA 800 

TACGCTGGTC AACTTCCAGC TGATAATATT GTTAGTATTG CTCAAAGCTC 850 

ATTTGGGCAA GGAATTTCAG TGACACAAAC ACAAATGCTT CGTGCCTTTA 900 

CAGCTATTGC TAATGATGGA GTTATGCTGG AGCCAAAATT TATAAGTGCT 950 

ATTTATGATA CTAACAATCA GTCTGTACGT AAGTCACAAA AAGAAATAGT 1000 

AGGAAATCCT GTTTCCAAAG AGGCAGCAAG CACAACTCGA AATCACATGA 1050 

TCTTAGTTGG GACGGACCCT CTATATGGAA CTATGTATAA TCACTACACA 1100 

GGAAAGCCAA TTATAACAGT TCCTGGACAA AATGTAGCAG TTAAATCCGG 1150 

TACGGCTCAA ATCGCTGATG AGAAAAATGG AGGATACTTG GTTGGTTCTA 1200 

CCAATTATAT TTTCTCAGTT GTGACTATGA ATCCTGCTGA AAATCCTGAT 12 50 

TTTATCTTGT ATGTAACGGT TCAACAGCCT GAGCATTATT CAGGTATCCA 13 00 

GTTGGGAGAA TTTGCCACCC CAATCTTGGA GCGGGCTTCA GCTATGAAAG 1350 

AATCTCTCAA TCTTCAATCT CCAGCCAAAA ATTTAGATAA AGTTACGACA 1400 

GAATCTTCTT ATGCAATGCC TAGCATCAAG GATATTTCAC CTGGTGAGTT 14 50 

GGCGGAAGCC TTACGCCGAA ATATTGTGCA ACCAATCGTT GTAGGTACTG 1500 

GAACAAAGAT TAAAGAGACT TCTGTAGAAG AAGGGACCAA TCTTGCACCA 1550 

AACCAACAAG TTCTCCTTTT ATCGGATAAG GTAGAAGAAA TTCCAGACAT 1600 

GTATGGCTGG AAAAAAGAGA CTGCCGAGAC CTTTGCTAAA TGGTTGGATA 1650 

TTGAACTGGA ATTTGAAGGT TCAGGTTCCG TT 1682 



2) INFORMATION FOR SEQ ID NO: 1037 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1681 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-04 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1037 

CAAAAAAGAA TTGGAAACTG CAGAGGTCAA GGGGATTGAT TTTACAACCA 50 

GTCCCAATCG TAGTTATCCA AACGGACAAT TTGCTTCTAG TTTTATCGGA 100 

CTAGCTCAGC TCCATGAAAA TGAAGATGGC AGCAAGAGCT TACTGGGAAC 150 

CTCTGGAATG GAGAGTTCCT TGAACAGTAT TCTTGCAGGG ACAGACGGTA 2 00 

TTATTACCTA TGAAAAAGAC CGTGTAGGAA ATATCGTACC AGGTACAGAA 250 

CTGGTATCGC AACAAACTGT GGATGGCAAG GATGTTTATA CAACATTGTC 3 00 

TAGTCCGCTA CAATCTTTCA TGGAAACTCA GATGGATGCC TTTCTAGAAA 350 

AAGTAAAAGG TAAGTATATG ACCGCGACCT TGGTCAGTGC AAAGACCGGT 4 00 

GAAATCCTCG CTACCACCCA ACGACCTACC TTTAATGCAG ATACTAAAGA 450 

AGGAATCACT GAGGACTTTG TTTGGCGTGA TATTCTTTAT CAAAGTAACT 500 

ATGAACCAGG ATCAGCCATG AAGGTTATGA CGTTAGCTTC TTCTATTGAT 550 

AATAATACCT TCCCAAGTGG AGAAT AC TTC AATAGCAGTG AATTCAAAAT 600 

AGCGGATGCG ACGACTCGAG ATTGGGATGT TAATGATGGT TTGACTACTG 650 

GTGGGATGAT GACTTTCTTA CAAGGTTTCG CTCACTCCAG TAATGTTGGA 700 
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ATGAGTCTAC TTGAACAAAA AATGGGAGAT GCTACTTGGT TGGATTATCT 750 

AAAACGCTTT AAATTTGGGG TTCCAACTCG CTTTGGCTTG ACAGATGAAT 800 

ACGCTGGTCA ACTTCCAGCT GATAATATTG TTAGTATTGC TCAAAGCTCA 850 

TTTGGGCAAG GAATTTCAGT GACACAAACA CAAATGCTTC GTGCCTTTAC 900 

5 AGCTATTGCT AATGATGGAG TTATGCTGGA GCCAAAATTT ATAAGTGCTA 950 

TTTATGATAC TAACAATCAG TCTGTACGTA AGTCACAAAA AGAAATAGTA 100 0 

GGAAATCCTG TTTCCAAAGA GGCAGCAAGC ACAACTCGAA ATCACATGAT 1050 

CTTAGTTGGG ACGGACCCTC TATATGGAAC TATGTATAAT CACTACACAG 1100 

GAAAGCCAAT TATAACAGTT CCTGGACAAA ATGTAGCAGT TAAATCCGGT 1150 

10 ACGGCTCAAA TCGCTGATGA GAAAAATGGA GGATACTTGG TTGGTTCTAC 1200 

CAATTATATT TTCTCAGTTG TGACTATGAA TCCTGCTGAA AATCCTGATT 1250 

TTATCTTGTA TGTAACGGTT CAACAGCCTG AGCATTATTC AGGTATCCAG 13 00 

TTGGGAGAAT TTGCCACCCC AATCTTGGAG CGGGCTTCAG CTATGAAAGA 1350 

ATCTCTCAAT CTTCAATCTC CAGCCAAAAA TTTAGATAAA GTTACGACAG 1400 

15 AATCTTCTTA TGCAATGCCT AGCATCAAGG ATATTTCACC TGGTGAGTTG 1450 

GCGGAAGCCT TACGCCGAAA TATTGTGCAA CCAATCGTTG TAGGTACTGG 1500 

AACAAAGATT AAAGAGACTT CTGTAGAAGA AGGGACCAAT CTTGCACCAA 1550 

ACCAACAAGT TCTCCTTTTA TCGGATAAGG TAGAAGAAAT TCCAGACATG 1600 

TATGGCTGGA AAAAAGAGAC TGCCGAGACC TTTGCTAAAT GGTTGGATAT 1650 

2 0 TGAACTGGAA TTTGAAGGTT CAGGTTCCGT T 1681 



2) INFORMATION FOR SEQ ID NO: 1038 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1685 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNES S : Doubl e 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1038 

40 AAAAAAGAAT TGGAAACTGC AGAGGTCAAG GGGATTGATT TTACAACCAG 50 

TCCCAATCGT AGTTATCCAA ACGGACAATT TGCTTCTAGT TTTATCGGAC 100 

TAGCTCAGCT CCATGAAAAT GAAGATGGCA GCAAGAGCTT ACTGGGAACC 150 

TCTGGAATGG AGAGTTCCTT GAACAGTATT CTTGCAGGGA CAGACGGTAT 2 00 

TATTACCTAT GAAAAAGACC GTGTAGGAAA TATCGTACCA GGTACAGAAC 2 50 

45 TGGTATCGCA ACAAACTGTG GATGGCAAGG ATGTTTATAC AACATTGTCT 3 00 

AGTCCGCTAC AATCTTTCAT GGAAACTCAG ATGGATGCCT TTCTAGAAAA 350 

AGTAAAAGGT AAGTATATGA CCGCGACCTT GGTCAGTGCA AAGACCGGTG 400 

AAATCCTCGC TACCACCCAA CGACCTACCT TTAATGCAGA TACTAAAGAA 4 50 

GGAATCACTG AGGACTTTGT TTGGCGTGAT ATTCTTTATC AAAGTAACTA 500 

50 TGAACCAGGA TCAGCCATGA AGGTTATGAC GTTAGCTTCT TCTATTGATA 550 

ATAATACCTT CCCAAGTGGA GAATACTTCA ATAGCAGTGA ATTCAAAATA 600 

GCGGATGCGA CGACTCGAGA TTGGGATGTT AATGATGGTT TGACTACTGG 650 

TGGGATGATG ACTTTCTTAC AAGGTTTCGC TCACTCCAGT AATGTTGGAA 700 

TGAGTCTACT TGAACAAAAA ATGGGAGATG CTACTTGGTT GGATTATCTA 750 

55 AAACGCTTTA AATTTGGGGT TCCAACTCGC TTTGGCTTGA CAGATGAATA 800 

CGCTGGTCAA CTTCCAGCTG ATAATATTGT TAGTATTGCT CAAAGCTCAT 850 

TTGGGCAAGG AATTTCAGTG ACACAAACAC AAATGCTTCG TGCCTTTACA 900 

GCTATTGCTA ATGATGGAGT TATGCTGGAG CCAAAATTTA TAAGTGCTAT 950 

TTATGATACT AACAATCAGT CTGTACGTAA GTCACAAAAA GAAATAGTAG 1000 

60 GAAATCCTGT TTCCAAAGAG GCAGCAAGCA CAACTCGAAA TCACATGATC 1050 
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TTAGTTGGGA CGGACCCTCT ATATGGAACT ATGTATAATC ACTACACAGG 1100 

AAAGCCAATT ATAACAGTTC CTGGACAAAA TGTAGCAGTT AAATCCGGTA 1150 

CGGCTCAAAT CGCTGATGAG AAAAATGGAG GATACTTGGT TGGTTCTACC 1200 

AATTATATTT TCTCAGTTGT GACTATGAAT CCTGCTGAAA ATCCTGATTT 1250 

5 TATCTTGTAT GTAACGGTTC AACAGCCTGA GCATTATTCA GGTATCCAGT 1300 

TGGGAGAATT TGCCACCCCA ATCTTGGAGC GGGCTTCAGC TATGAAAGAA 1350 

TCTCTCAATC TTCAATCTCC AGCCAAAAAT TTAGATAAAG TTACGACAGA 1400 

ATCTTCTTAT GCAATGCCTA GCATCAAGGA TATTTCACCT GGTGAGTTGG 1450 

CGGAAGCCTT ACGCCGAAAT ATTGTGCAAC CAATCGTTGT AGGTACTGGA 1500 

10 ACAAAGATTA AAGAGACTTC TGTAGAAGAA GGGACCAATC TTGCACCAAA 1550 

CCAACAAGTT CTCCTTTTAT CGGATAAGGT AGAAGAAATT CCAGACATGT 1600 

ATGGCTGGAA AAAAGAGACT GCCGAGACCT TTGCTAAATG GTTGGATATT 1650 

GAACTGGAAT TTGAAGGTTC AGGTTCCGTT GTTCA 1685 



15 



30 



2) INFORMATION FOR SEQ ID NO: 1039 



(i) SEQUENCE CHARACTERISTICS: 
2 0 (A) LENGTH: 1679 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1039 



AAAAGAATTG GAAACTGCAG AGGTCAAGGG GATTGATTTT ACAACCAGTC 50 

CCAATCGTAG TTATCCAAAC GGACAATTTG CTTCTAGTTT TATCGGACTA 100 

35 GCTCAGCTCC ATGAAAATGA AGATGGCAGC AAGAGCTTAC TGGGAACCTC 150 

TGGAATGGAG AGTTCCTTGA ACAGTATTCT TGCAGGGACA GACGGTATTA 2 00 

TTACCTATGA AAAAGACCGT GTAGGAAATA TCGTACCAGG TACAGAACTG 250 

GTATCGCAAC AAACTGTGGA TGGCAAGGAT GTTTATACAA CATTGTCTAG 300 

TCCGCTACAA TCTTTCATGG AAACTCAGAT GGATGCCTTT CTAGAAAAAG 350 

40 TAAAAGGTAA GTATATGACC GCGACCTTGG TCAGTGCAAA GACCGGTGAA 400 

ATCCTCGCTA CCACCCAACG ACCTACCTTT AATGCAGATA CTAAAGAAGG 450 

AATCACTGAG GACTTTGTTT GGCGTGATAT TCTTTATCAA AGTAACTATG 500 

AACCAGGATC AGCCATGAAG GTTATGACGT TAGCTTCTTC TATTGATAAT 550 

AATACCTTCC CAAGTGGAGA ATACTTCAAT AGCAGTGAAT TCAAAATAGC 600 

45 GGATGCGACG ACTCGAGATT GGGATGTTAA TGATGGTTTG ACTACTGGTG 650 

GGATGATGAC TTTCTTACAA GGTTTCGCTC ACTCCAGTAA TGTTGGAATG 700 

AGTCTACTTG AACAAAAAAT GGGAGATGCT ACTTGGTTGG ATTATCTAAA 750 

ACGC TTTAAA TTTGGGGTTC CAACTCGCTT TGGCTTGACA GATGAATACG 800 

CTGGTCAACT TCCAGCTGAT AATATTGTTA GTATTGCTCA AAGCTCATTT 850 

50 GGGCAAGGAA TTTCAGTGAC ACAAACACAA ATGCTTCGTG CCTTTACAGC 900 

TATTGCTAAT GATGGAGTTA TGCTGGAGCC AAAATTTATA AGTGCTATTT 950 

ATGATACTAA CAATCAGTCT GTACGTAAGT CACAAAAAGA AATAGTAGGA 1000 

AATCCTGTTT CCAAAGAGGC AGCAAGCACA ACTCGAAATC ACATGATCTT 1050 

AGTTGGGACG GACCCTCTAT ATGGAACTAT GTATAATCAC TACACAGGAA 1100 

55 AGCCAATTAT AACAGTTCCT GGACAAAATG TAGCAGTTAA ATCCGGTACG 1150 

GCTCAAATCG CTGATGAGAA AAATGGAGGA TACTTGGTTG GTTCTACCAA 1200 

TTATATTTTC TCAGTTGTGA CTATGAATCC TGCTGAAAAT CCTGATTTTA 1250 

TCTTGTATGT AACGGTTCAA CAGCCTGAGC ATTATTCAGG TATCCAGTTG 1300 

GGAGAATTTG CCACCCCAAT CTTGGAGCGG GCTTCAGCTA TGAAAGAATC 1350 

60 TCTCAATCTT CAATCTCCAG CCAAAAATTT AGATAAAGTT ACGACAGAAT 1400 
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CTTCTTATGC AATGCCTAGC ATCAAGGATA 
GAAGCCTTAC GCCGAAATAT TGTGCAACCA 
AAAGATTAAA GAGACTTCTG TAGAAGAAGG 
AACAAGTTCT CCTTTTATCG GATAAGGTAG 
5 GGCTGGAAAA AAGAGACTGC CGAGACCTTT 
ACTGGAATTT GAAGGTTCAG GTTCCGTTG 



PCT/CA00/01150 

TTTCACCTGG TGAGTTGGCG 1450 
ATCGTTGTAG GTACTGGAAC 1500 
GACCAATCTT GCACCAAACC 1550 
AAGAAATTCC AGACATGTAT 1600 
GCTAAATGGT TGGATATTGA 1650 

1679 



10 2) INFORMATION FOR SEQ ID NO: 1040 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1685 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-07 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104 0 



TAAAAAGGAT CTAAAAGACG CTAGTGTTGA AGGAATTGAC TTCACAACTA 50 

GCCCTAATAG AAGCTATCCA AATGGACAAT TCGCTTCTAG TTTTATTGGT 100 

TTGGCCCAAC TCCATGAAAA TGAGGATGGT AGCAAGAGTT TGCTGGGAAC 150 

TTCTGGGATG GAGAGTTCCT TGAACAGTAT TCTTGCAGGG AAAGACGGTA 200 

30 TTATTACCTA TGAAAAAGAT CGTCTGGGTA ATATTGTCCC TGGAACAGAA 250 

CAAGTTTCCC AACAAACGGT AGATGGCAAG GATGTTTATA CAACATTGTC 300 

TAGTCCGCTA CAATCTTTCA TGGAAACTCA GATGGATGCC TTTCTAGAAA 350 

AAGTAAAAGG TAAGTATATG ACCGCGACCT TGGTCAGTGC AAAGACCGGT 400 

GAAATCCTCG CTACCACCCA ACGACCTACC TTTAATGCAG ATACTAAAGA 450 

35 AGGAATCACT GAGGACTTTG TTTGGCGTGA TATTCTTTAT CAAAGTAACT 500 

ATGAACCAGG ATCAGCCATG AAGGTTATGA CGTTAGCTTC TTCTATTGAT 550 

AATAATACCT TCCCAAGTGG AGAATACTTC AATAGCAGTG AATTCAAAAT 600 

AGCGGATGCG ACGACTCGAG ATTGGGATGT TAATGAAGGT TTGACTACTG 650 

GTGGGATGAT GACTTTCTTA CAAGGTTTCG CTCACTCCAG TAATGTTGGA 700 

40 ATGAGTCTAC TTGAACAAAA AATGGGAGAT GCTACTTGGT TGGATTATCT 750 

AAAACGCTTT AAATTTGGGG TTCCAACTCG CTTTGGCTTG ACAGATGAAT 800 

ACGCTGGTCA ACTTCCAGCT GATAATATTG TTAGTATTGC TCAAAGCTCA 850 

TTTGGGCAAG GAATTTCAGT GACACAAACA CAAATGCTTC GTGCCTTTAC 900 

AGCTATTGCT AATGATGGAG TTATGCTGGA GCCAAAATTT ATAAGTGCTA 950 

45 TTTATGATAC TAACAATCAG TCTGTACGTA AGTCACAAAA AGAAATAGTA 1000 

GGAAATCCTG TTTCCAAAGA GGCAGCAAGC ACAACTCGAA ATCACATGAT 1050 

CTTAGTTGGG ACGGACCCTC TATATGGAAC TATGTATAAT CACTACACAG 1100 

GAAAGCCAAT TATAACAGTT CCTGGACAAA ATGTAGCAGT TAAATCCGGT 1150 

ACGGCTCAAA TCGCTGATGA GAAAAATGGA GGATACTTGG TTGGTTCTAC 1200 

50 CAATTATATT TTCTCAGCTG TGACTATGAA TCCTGCTGAA AATCCTGATT 1250 

TTATCTTGTA TGTAACGGTT CAACAGCCTG AGCATTATTC AGGTATCCAG 1300 

TTGGGAGAAT TTGCCACCCC AATCTTGGAG CGGGCTTCAG CTATGAAAGA 1350 

ATCTCTCAAT CTTCAATCTC CAGCCAAAAA TTTAGATAAA GTTACGACAG 1400 

AATCTTCTTA TGCAATGCCT AGCATCAAGG ATATTTCACC TGGTGAGTTG 1450 

55 GCGGAAGCCT TACGCCGAAA TATTGTGCAA CCAATCGTTG TAGGTACTGG 1500 

AACAAAGATT AAAGAGACTT CTGTAGAAGA AGGGACCAAT CTTGCACCAA 1550 

ACCAACAAGT TCTCCTTTTA TCGGATAAGG TAGAAGAAAT TCCAGACATG 1600 

TATGGCTGGA AAAAAGAGAC TGCCGAGACC TTTGCTAAAT GGTTGGATAT 1650 

TGAACTGGAA TTTGAAGGTT CAGGTTCCGT CGTTC 1685 



60 
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PCT/CA00/01150 



10 



55 



2) INFORMATION FOR SEQ ID NO: 1041 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1696 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
15 (B) STRAIN: StrR-08 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1041 

GTCTATCAAA AAAGAATTGG AAACTGCAGA GGTCAAGGGG ATTGATTTTA 50 

20 CAACCAGTCC CAATCGTAGT TATCCAAACG GACAATTTGC TTCTAGTTTT 100 

ATCGGACTAG CTCAGCTCCA TGAAAATGAA GATGGCAGCA AGAGCTTACT 150 

GGGAACCTCT GGAATGGAGA GTTCCTTGAA CAGTATTCTT . GCAGGGACAG 200 

ACGGTATTAT TACCTATGAA AAAGACCGTG TAGGAAATAT CGTACCAGGT 250 

ACAGAACTGG TATCGCAACA AACTGTGGAT GGCAAGGATG TTTATACAAC 300 

25 ATTGTCTAGT CCGCTACAAT CTTTCATGGA AACTCAGATG GATGCCTTTC 350 

TAGAAAAAGT AAAAGGTAAG TATATGACCG CGACCTTGGT CAGTGCAAAG 400 

ACCGGTGAAA TCCTCGCTAC CACCCAACGA CCTACCTTTA ATGCAGATAC 450 

TAAAGAAGGA ATCACTGAGG ACTTTGTTTG GCGTGATATT CTTTATCAAA 500 

GTAACTATGA AC CAGGATCA GCCATGAAGG TTATGACGTT AGCTTCTTCT 550 

3 0 ATTGATAATA ATACCTTCCC AAGTGGAGAA TACTTCAATA GCAGTGAATT 600 

CAAAATAGCG GATGCGACGA CTCGAGATTG GGATGTTAAT GATGGTTTGA 650 

CTACTGGTGG GATGATGACT TTCTTACAAG GTTTCGCTCA CTCCAGTAAT 700 

GTTGGAATGA GTCTACTTGA ACAAAAAATG GGAGATGCTA CTTGGTTGGA 750 

TTATCTAAAA CGCTTTAAAT TTGGGGTTCC AACTCGCTTT GGCTTGACAG 8 00 

35 ATGAATACGC TGGTCAACTT CCAGCTGATA ATATTGTTAG TATTGCTCAA 850 

AGCTCATTTG GGCAAGGAAT TTCAGTGACA CAAACACAAA TGCTTCGTGC 900 

CTTTACAGCT ATTGCTAATG ATGGAGTTAT GCTGGAGCCA AAATTTATAA 950 

GTGCTATTTA TGATACTAAC AATCAGTCTG TACGTAAGTC ACAAAAAGAA 1000 

ATAGTAGGAA ATCCTGTTTC .CAAAGAGGCA GCAAGCACAA CTCGAAATCA 1050 

40 CATGATCTTA GTTGGGACGG ACCCTCTATA TGGAACTATG TATAATCACT 1100 

ACACAGGAAA GCCAATTATA ACAGTTCCTG GACAAAATGT AGCAGTTAAA 1150 

TCCGGTACGG CTCAAATCGC TGATGAGAAA AATGGAGGAT ACTTGGTTGG 12 00 

TTCTACCAAT TATATTTTCT CAGTTGTGAC TATGAATCCT GCTGAAAATC 1250 

CTGATTTTAT CTTGTATGTA ACGGTTCAAC AGCCTGAGCA TTATTCAGGT 1300 

45 ATCCAGTTGG GAGAATTTGC CACCCCAATC TTGGAGCGGG CTTCAGCTAT 1350 

GAAAGAATCT CTCAATCTTC AATCTCCAGC CAAAAATTTA GATAAAGTTA 1400 

CGACAGAATC TTCTTATGCA ATGCCTAGCA TCAAGGATAT TTCACCTGGT 1450 

GAGTTGGCGG AAGCCTTACG CCGAAATATT GTGCAACCAA TCGTTGTAGG 1500 

TACTGGAACA AAGATTAAAG AGACTTCTGT AGAAGAAGGG ACCAATCTTG 1550 

50 CACCAAACCA ACAAGTTCTC CTTTTATCGG ATAAGGTAGA AGAAATTCCA 1600 

GACATGTATG GCTGGAAAAA AGAGACTGCC GAGACCTTTG CTAAATGGTT 1650 

GGATATTGAA CTGGAATTTG AAGGTTCAGG TTCCGTTGTT CAGAAG 1696 



2) INFORMATION FOR SEQ ID NO: 1042 



. (i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1689 bases 
60 (B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-09 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1042 

AAAAAAGAGT TGGAAACTGC AGAGGTCAAG GGGATTGATT TTACAACCAG 50 

TCCCAATCGT AGTTACCCAA ACGGACAATT TGCTTCTAGT TTTATCGGAC 100 

TAGCTCAGCT CCATGAAAAT GAAGATGGCA GCAAGAGCTT GCTGGGAACC 150 

15 TCTGGAATGG AGAGTTCCTT GAACAGTATT CTTGCAGGGA CAGACGGCAT 200 

TATTACCTAT GAAAAGGATC GTCTGGGAAA TATTGTCCCC GGAACGGAAC 250 

AAGTTTCCCA ACAAACTGTA GATGGCAAGG ATGTTTATAC GACTATTTCC 300 

AGCACCCTTC AGTCCTTCAT GGAAACTCAG ATGGATGCCT TTCTAGAAAA 350 

AGTAAAAGGT AAGTATATGA CCGCGACCTT GGTCAGTGCA AAGACCGGTG 400 

2 0 AAATCCTCGC TACCACCCAA CGACCTACCT TTAATGCAGA TACTAAAGAA 450 

GGAATCACTG AGGACTTTGT TTGGCGTGAT ATTCTTTATC AAAGTAACTA 500 

TGAACCAGGA TCAGCCATGA AGGTTATGAC GTTAGCTTCT TCTATTGATA 550 

ATAATACCTT CCCAAGTGGA GAATACTTCA ATAGCAGTGA ATTCAAAATA 600 

GCGGATGCGA CGACTCGAGA TTGGGATGTT AATGATGGTT TGACTACTGG 650 

25 TGGGATGATG ACTTTCTTAC AAGGTTTCGC TCACTCCAGT AATGTTGGAA 700 

TGAGTCTACT TGAACAAAAA ATGGGAGATG CTACTTGGTT GGATTATCTA 750 

AAACGCTTTA AATTTGGGGT TCCAACTCGC TTTGGCTTGA CAGATGAATA 800 

CGCTGGTCAA CTTCCAGCTG ATAATATTGT TAGTATTGCT CAAAGCTCAT 850 

TTGGGCAAGG AATTTCAGTG ACACAAACAC AAATGCTTCG TGCCTTTACA 900 

3 0 GCTATTGCTA ATGATGGAGT TATGCTGGAG CCAAAATTTA TAAGTGCTAT 950 

TTATGATACT AACAATCAGT CTGTACGTAA GTCACAAAAA GAAATAGTAG 1000 

GAAATCCTGT TTCCAAAGAG GCAGCAAGCA CAACTCGAAA TCACATGATC 1050 

TTAGTTGGGA CGGACCCTCT AT ATGGAAC T ATGTATAATC ACTACACAGG 1100 

AAAGCCAATT ATAACAGTTC CTGGACAAAA TGTAGCAGTT AAATCCGGTA 1150 

3 5 CGGCTCAAAT CGCTGATGAG AAAAATGGAG GATACTTGGT TGGTTCTACC 1200 

AATTATATTT TCTCAGTTGT GACTATGAAT CCTGCTGAAA ATCCTGATTT 1250 

TATCTTGTAT GTAACGGTTC AACAGCCTGA GCATTATTCA GGTATCCAGT 13 00 

TGGGAGAATT TGCCACCCCA ATCTTGGAGC GGGCTTCAGC TATGAAAGAC 1350 

TCTCTCAATC TTCAAACAAC AGCTAAAGCT TTGGAGCAAG TAAGTCAACA 1400 

40 AAGTCCTTAT CCTATGCCTA GTGTCAAGGA TATTTCACCT GGTGATTTAG 1450 

CAGAAGAATT GCGTCGCAAT CTTGTACAAC CCATCGTTGT GGGAACAGGA 1500 

ACGAAGATTA AAAACAGTTC TGCTGAAGAA GGGAAGAATC TTGCCCCGAA 1550 

TCAGCAAGTC CTTATCTTAT CTGATAAAGT AGAGGAAGTT CCAGATATGT 1600 

ATGGTTGGAC AAAGGAGACT GCTGAGACCC TTGCTAAGTG GCTCAATATA 1650 

45 GAACTTGAAT TTCAAGGTTC GGGTTCTACT GTGCAGAAG 1689 



2) INFORMATION FOR SEQ ID NO: 1043 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1690 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Streptococcus pneumoniae 
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(B) STRAIN: StrR-10 



PCT/CA00/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1043 

CAAAAAAGAG TTGGAAACTG CAGAGGTCAA GGGGATTGAT TTTACAACCA 50 

GTCCTAATCG TAGTTACCCA AACGGACAAT TTGCTTCTAG TTTTATCGGT 100 

CTAGCTCAAC TCCATGAAAA TGAAGATGGA AGCAAGAGTT TGCTGGGAAC 150 

TTCTGGAATG GAGAGTTCCT TGAACAGTAT TCTTGCAGGG AAAGACGGTA 200 

TTATTACTTA TGAAAAGGAT CGTCTGGGTA ATATTGTCCC TGGAACAGAA 250 

CAAGTTTCCC AACAAACGGT AGATGGCAAG GATGTTTATA CAACATTGTC 300 

TAGTCCGCTC CAATCTTTCA TGGAAACTCA GATGGATGCC TTTCTAGAAA 350 

AAGTAAAAGG TAAGTATATG ACCGCGACCT TGGTCAGTGC AAAGACCGGT 400 

GAAATCCTCG CTACCACCCA ACGACCTACC TTTAATGCAG ATACTAAAGA 450 

AGGAATCACT GAGGACTTTG TTTGGCGTGA TATTCTTTAT CAAAGTAACT 500 

ATGAACCAGG ATCAGCCATG AAGGTTATGA CGTTAGCTTC TTCTATTGAT 550 

AATAATACCT TCCCAAGTGG AGAATACTTC AATAGCAGTG AATTCAAAAT 600 

AGCGGATGCG ACGACTCGAG ATTGGGATGT TAATGATGGT TTGACTACTG 650 

GTGGGATGAT GACTTTCTTA CAAGGTTTCG CTCACTCCAG TAATGTTGGA 700 

ATGAGTCTAC TTGAACAAAA AATGGGAGAT GCTACTTGGT TGGATTATCT 750 

AAAACGCTTT AAATTTGGGG TTCCAACTCG CTTTGGCTTG ACAGATGAAT 800 

ACGCTGGTCA ACTTCCAGCT GATAATATTG TTAGTATTGC TCAAAGCTCA 850 

TTTGGGCAAG GAATTTCAGT GACACAAACA CAAATGCTTC GTGCCTTTAC 900 

AGCTATTGCT AATGATGGAG TTATGCTGGA GCCAAAATTT ATAAGTGCTA 950 

TTTATGATAC TAACAATCAG TCTGTACGTA AGTCACAAAA AGAAATAGTA 1000 

GGAAATCCTG TTTCCAAAGA GGCAGCAAGC ACAACTCGAA ATCACATGAT 1050 

CTTAGTTGGG ACGGACCCTC TATATGGAAC TATGTATAAT CACTACACAG 1100 

GAAAGCCAAT TATAACAGTT CCTGGACAAA ATGTAGCAGT TAAATCCGGT 1150 

ACGGCTCAAA TCGCTGATGA GAAAAATGGA GGATACTTGG TTGGTTCTAC 1200 

CAATTATATT TTCTCAGTTG TGACTATGAA TCCTGCTGAA AATCCTGATT 1250 

TTATCTTGTA TGTAACGGTT CAACAGCCTG AGCATTATTC AGGTATCCAG 13 00 

TTGGGAGAAT TTGCCACCCC AATCTTGGAG CGGGCTTCAG CTATGAAAGA 13 50 

ATCTCTCAAT CTTCAATCTC CAGCCAAAAA TTTAGATAAA GTTACGACAG 1400 

AATCTTCTTA TGCAATGCCT AGCATCAAGG ATATTTCACC TGGTGAGTTG 1450 

GCGGAAGCCT TACGCCGAAA TATTGTGCAA CCAATCGTTG TAGGTACTGG 1500 

AACAAAGATT AAAGAGACTT CTGTAGAAGA AGGGACCAAT CTTGCACCAA 1550 

ACCAACAAGT TCTCCTTTTA TCGGATAAGG TAGAAGAAAT TCCAGACATG 1600 

TATGGCTGGA AAAAAGAGAC TGCCGAGACC TTTGCTAAAT GGTTGGATAT 1650 

TGAACTGGAA TTTGAAGGTT CAGGTTCCGT TGTTCAGAAG 1690 



2) INFORMATION FOR SEQ ID NO: 1044 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1668 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1044 

AGTTGGAAAC TGCAGAGGTC AAGGGGATTG ATTTTACAAC CAGTCCTAAT 50 
CGTAGTTACC CAAACGGACA ATTTGCTTCT AGTTTTATCG GACTAGCTCA 100 
GCTCCATGAA AATGAAGATG GCAGCAAGAG CTTGCTGGGA ACCTCTGGGA 150 
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TGGAGAGTTC TTTAAATAGA ATTCTTGCAG GGACAGACGG CATTATTACC 200 

TATGAAAAGG ATCGTCTGGG AAATATTGTC CCCGGAACGG AACAAGTTTC 250 

CCAACAAACG GTAGATGGCA AGGATGTTTA TACGACTATT TCCAGCACCC 300 

TTCAGTCCTT CATGGAGACC CAGATGAATG CCTTTCAAGA AAAAGTAAAA 3 50 

5 GGCAAGTATA TGACGGCTAC CTTGGTCAGT GCTAAAACAG GGGAAATTCT 400 

TGCAACAACG CAACGGCCGA CCTTCGATGC TGATACTAAG GAAGGACTTA 450 

CCAAGGACTT TGTTTGGCGT GATATCCTCT ATCAAAGTAA CTATGAGCCA 500 

GGGTCAACCA TGAAGGTCAT GACGCTTGCT GCTGCTATTG . ATAATAACAC 550 

TTTCCCAGGA GGAGAAGTTT TCAATAGTAG TGAATTAAAA ATAGCGGATG 600 

10 CGACAATTCG AGATTGGGAT GTTAATGATG GTTTGACGAC TGGTGGGATG 650 

ATGACTTTCT TACAAGGTTT CGCTCACTCC AGTAATGTTG GAATGAGTCT 700 

ACTTGAACAA AAAATGGGAG ATGCTACTTG GTTGGATTAT CTAAACCGCT 750 

TTAAGTTTGG GGTGCCGACG CGTTTTGGTC TGACTGATGA GTATTCAGGT 800 

CAATTGCCTG CAGATAATAT TGTTAATATT GCCATGAGTG CATTTGGTCA 850 

15 GGGGATTTCA GTGACCCAGA CGCAAATGAT TCGTGCCTTT ACAGCTATTG 900 

CTAATGATGG TGTTATGCTG GAGCCTAAAT TTATTAGTGC CATTTATGAT 950 

CCAAATGATC AAACTGCTCG GAAATCTCAA AAAGAAATTG TGGGAAATCC 1000 

TGTTTCTAAA GATGCAGCTA GTCTAACTCG GACTAACATG ATTTTGGTAG 1050 

GGACGGATCC GGTTTATGGA ACCATGTATA ACCACAGCAC AGGTAAGCCA 1100 

20 ACTGTAACTG TTCCTGGGCA AAATGTAGCC CTCAAGTCTG GTACGGCTCA 1150 

GATTGCTGAC GAGAAAAATG GTGGTTATCT AGTCGGGTTA ACCAACTATA 1200 

TTTTCTCGGC TGTATCGATG AATCCGGCTG AAAATCCTGA TTTTATCTTG 1250 

TATGTAACGG TACAGCAACC TGAACATTAT TCAGGTATCC AGTTGGGAGA 1300 

ATTTGCCACC CCAATCTTGG AGCGGGCTTC AGCTATGAAA GAATCTCTCA 1350 

25 ATCTTCAATC TCCAGCCAAG AATTTAGATA AAGTTACGAC AGAATCTTCT 1400 

TATGCAATGC CTAGCATCAA GGATATTTCA CCTGGTGAGT TGGCGGAAGC 1450 

CTTACGCCGA AATATTGTGC AACCAATCGT TGTAGGTACT GGAACAAAGA 1500 

TTAAAGAGAC TTCTGTAGAA GAAGGGACCA ATCTTGCACC AAACCAACAA 1550 

GTTCTCCTTT TATCGGATAA GGTAGAAGAA ATTCCAGACA TGTATGGCTG 1600 

30 GAAAAAAGAG ACTGCTGAAA CCTTTGCTAA ATGGTTGGAT ATTGAGCTGG 1650 

AATTTGAAGG GTCAGGTT 1668 



35 2) INFORMATION FOR SEQ ID NO: 1045 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1680 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: .Streptococcus pneumoniae 

(B) STRAIN: StrR-12 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1045 



TTGGAAACTG CAGAGGTCAA GGGGATTGAT TTTACAACCA GTCCCAATCG 50 

TAGTTACCCA AATGGACAAT TTGCTTCTAG TTTTATCGGT CTAGCTCAGC 100 

TCCATGAAAA TGAAGATGGA AGCAAGAGTT TGCTGGGAAC CTCTGGAATG 150 

GAGAGTTCCT TGAACAGTAT TCTTGCAGGG ACAGACGGCA TTATTACCTA 200 

55 TGAAAAGGAT CGTCTGGGCA ATATTGTACC CGGAACAGAA CAAGTTTCCC 250 

AACAAACGGT GGATGGCAAG GATGTTTACA CAACCATTTC CAGCCCCCTC 300 

CAGTCCTTCA TGGAAACTCA GATGGATGCC TTTCTAGAAA AAGTAAAAGG 350 

TAAGTATATG ACCGCGACCT TGGTCAGTGC AAAGACCGGT GAAATCCTCG 400 

CTACCACCCA ACGACCTACC TTTAATGCAG ATACTAAAGA AGGAATCACT 450 

60 GAGGACTTTG TTTGGCGTGA TATTCTTTAT CAAAGTAACT ATGAACCAGG 500 
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ATCAGCCATG AAGGTTATGA CGTTAGCTTC TTCTATTGAT AATAATACCT 550 

TCCCAAGTGG AGAATACTTC AATAGCAGTG AATTCAAAAT AGCGGATGCG 600 

ACGACTCGAG ATTGGGATGT TAATGATGGT TTGACTACTG GTGGGATGAT 650 

GACTTTCTTA CAAGGTTTCG CTCACTCCAG TAATGTTGGA ATGAGTCTAC 700 

5 TTGAACAAAA AATGGGAGAT GCTACTTGGT TGGATTATCT AAAACGCTTT 750 

AAATTTGGGG TTCCAACTCG CTTTGGCTTG ACAGATGAAT ACGCTGGTCA 800 

ACTTCCAGCT GATAATATTG TTAGTATTGC TCAAAGCTCA TTTGGGCAAG 850 

GAATTTCAGT GACACAAACA CAAATGCTTC GTGCCTTTAC AGCTATTGCT 900 

AATGATGGAG TTATGCTGGA GCCAAAATTT ATAAGTGCTA TTTATGATAC 950 

10 TAACAATCAG TCTGTACGTA AGTCACAAAA AGAAATAGTA GGAAATCCTG 1000 

TTTCCAAAGA GGCAGCAAGC ACAACTCGAA ATCACATGAT CTTAGTTGGG 1050 

ACGGACCCTC TATATGGAAC TATGTATAAT CACTACACAG GAAAGCCAAT 1100 

TATAACAGTT CCTGGACAAA ATGTAGCAGT TAAATCCGGT GCGGCTCAAA 1150 

TCGCTGATGA GAAAAATGGA GGATACTTGG TTGGTTCTAC CAATTATATT 12 00 

15 TTCTCAGTTG TGACTATGAA TCCTGCTGAA AATCCTGATT TTATCTTGTA 1250 

TGTAACGGTT CAACAGCCTG AGCATTATTC AGGTATCCAG TTGGGAGAAT 13 00 

TTGCCACCCC AATCTTGGAG CGGGCTTCAG CTATGAAAGA ATCTCTCAAT 13 50 

CTTCAATCTC CAGCCAAAAA TTTAGATAAA GTTACGACAG AATCTTCTTA 14 00 

TGCAATGCCT AGCATCAAGG ATATTTCACC TGGTGAGTTG GCGGAAGCCT 14 50 

20 TACGCCGAAA TATTGTGCAA CCAATCGTTG TAGGTACTGG AACAAAGATT 1500 

AAAGAGACTT CTGTAGAAGA AGGGACCAAT CTTGCACCAA ACCAACAAGT 1550 

TCTCCTTTTA TCGGATAAGG TAGAAGAAAT TCCAGACATG TATGGCTGGA 1600 

AAAAAGAGAC TGCCGAGACC TTTGCTAAAT GGTTGGATAT TGAACTGGAA 16 50 

TTTGAAGGTT CAGGTTCCGT TGTTCAGAAG 16 80 

25 



2) INFORMATION FOR SEQ ID NO: 1046 

30 (i) SEQUENCE CHARACTERISTICS': 

(A) LENGTH: 1689 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
40 (B) STRAIN: StrR-13 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1046 

AAAAAAGAGT TGGAAACTGC AGAGGTCAAG GGGATTGATT TTACAACCAG 50 

45 TCCTAATCGT AGTTACCCAA ACGGACAATT TGCTTCTAGT TTTATCGGTC 100 

TAGCTCAACT CCATGAAAAT GAAGATGGAA GCAAGAGTTT GCTGGGAACT 150 

TCTGGAATGG AGAGTTCCTT GAACAGTATT CTTGCAGGGA AAGACGGTAT 2 00 

TATTACTTAT GAAAAGGATC GTCTGGGTAA TATTGTCCCT GGAACAGAAC 250 

AAGTTTCCCA ACAAACGGTA GATGGCAAGG ATGTTTATAC AACATTGTCT 3 00 

50 AGTCCGCTCC AATCTTTCAT GGAAACTCAG ATGGATGCCT TTCTAGAAAA 3 50 

AGTAAAAGGT AAGTATATGA CCGCGACCTT GGTCAGTGCA AAGACCGGTG 4 00 

AAATCCTCGC TACCACCCAA CGACCTACCT TTAATGCAGA TACTAAAGAA 4 50 

GGAATCACTG AGGACTTTGT TTGGCGTGAT ATTCTTTATC AAAGTAACTA 500 

TGAACCAGGA TCAGCCATGA AGGTTATGAC GTTAGCTTCT TCTATTGATA 550 

55 ATAATACCTT CCCAAGTGGA GAATACTTCA ATAGCAGTGA ATTCAAAATA 600 

GCGGATGCGA CGACTCGAGA TTGGGATGTT AATGATGGTT TGACTACTGG 650 

TGGGATGATG ACTTTCTTAC AAGGTTTCGC TCACTCCAGT AATGTTGGAA 700 

TGAGTCTACT TGAACAAAAA ATGGGAGATG CTACTTGGTT GGATTATCTA 750 

AAACGCTTTA AATTTGGGGT TCCAACTCGC TTTGGCTTGA CAGATGAATA 800 

60 CGCTGGTCAA CTTCCAGCTG ATAATATTGT TAGTATTGCT CAAAGCTCAT 850 
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TTGGGCAAGG AATTTCAGTG ACACAAACAC AAATGCTTCG TGCCTTTACA 900 
GCTATTGCTA ATGATGGAGT TATGCTGGAG CCAAAATTTA TAAGTGCTAT 950 

TTATGATACT AACAATCAGT CTGTACGTAA GTCACAAAAA GAAATAGTAG 1000 

GAAATCCTGT TTCCAAAGAG GCAGCAAGCA CAACTCGAAA TCACATGATC 1050 

TTAGTTGGGA CGGACCCTCT ATATGGAACT ATGTATAATC ACTACACAGG 1100 

AAAGCCAATT ATAACAGTTC CTGGACAAAA TGTAGCAGTT AAATCCGGTA 1150 

CGGCTCAAAT CGCTGATGAG AAAAATGGAG GATACTTGGT TGGTTCTACC 12 00 

AATTATATTT TCTCAGTTGT GACTATGAAT CCTGCTGAAA ATCCTGATTT 12 50 

TATCTTGTAT GTAACGGTTC AACAGCCTGA GCATTATTCA GGTATCCAGT 13 00 

TGGGAGAATT TGCCACCCCA ATCTTGGAGC GGGCTTCAGC TATGAAAGAA 1350 

TCTCTCAATC TTCAATCTCC AGCCAAAAAT TTAGATAAAG TTACGACAGA 14 00 

ATCTTCTTAT GCAATGCCTA GCATCAAGGA TATTTCACCT GGTGAGTTGG 1450 

CGGAAGCCTT ACGCCGAAAT ATTGTGCAAC CAATCGTTGT AGGTACTGGA 1500 

ACAAAGATTA AAGAGACTTC TGTAGAAGAA GGGACCAATC TTGCACCAAA 1550 

CCAACAAGTT CTCCTTTTAT CGGATAAGGT AGAAGAAATT CCAGACATGT 1600 

ATGGCTGGAA AAAAGAGACT GCCGAGACCT TTGCTAAATG GTTGGATATT 1650 

GAACTGGAAT TTGAAGGTTC AGGTTCCGTT GTTCAGAAG 1689 



2) INFORMATION FOR SEQ ID NO: 1047 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1690 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-14 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1047 

CAAAAAAGAA TTGGAAGCTG CAGAGGTCAA GGGGATTGAT TTTACAACCA 50 

GTCCCAACCG TAGTTACCCA AACGGACAAT TTGCTTCTAG TTTTATCGGA 100 

CTAGCTCAGC TCCATGAAAA TGAAGATGGC AGCAAGAGCT TGCTGGGAAC 150 

TTCTGGAATG GAGAGTTCCT TGAACAGTAT TCTTGCAGGG ACAGACGGCA 2 00 

TTATTACCTA TGAAAAGGAT CGTCTGGGTA ATATTGTACC CGGAACAGAA 2 50 

CAAGTTTCCC AACAAACGGT AGATGGCAAG GATGTTTACA CAACCATTTC 3 00 

CAGCCCCCTC CAGTCCTTTA TGGAAACCCA GATGGATGCT TTTCAAGAGA 350 

AGGTAAAAGG AAAGTACACG ACAGCGACTT TGGTCAGTGC TAAAACAGGG 400 

GAAATTCTGG CAACAACGCA ACGACCGACC TTTGATGCAG ATACAAAAGA 450 

AGGCATTACA GAGGACTTTG TTTGGCGTGA TATCCTTTAC CAAAGTAACT 500 

ATGAGCCAGG TTCCCCTATG AAAGTGATGA TGTTGGCTGC TGCTATTGAT 550 

AATAATACCT TTCCAGGGGG AGAAGTCTTC AATAGTAGTG AGTTAAAAAT 600 

TGCAGATGTC ACGATTCGAG ATTGGGACGT CAATGAAGGA TTGACTGGTG 650 

GCAGAATGAT GACCTTTTCT CAAGGGTTCG CTCACTCAAG TAACGTTGGG 700 

ATGACGCTTC TTGAGCAAAA GATGGGAGAT GCTACATGGC TTGATTATCT 750 

AAATCGCTTT AAATTTGGTG TTCCGACTCG TTTTGGCTTG ACGGATGAAT 800 

ATGCAGGCCA ACTTCCAGCT GACAATATCG TAAATATTGC TCAGAGTTCA 850 

TTTGGACAAG GGATTTCAGC GACCCAGACG CAAATGATTC GTGCCTTCAC 900 

GGCTATTGCC AACGATGGAG TCATGTTAGA ACCTAAATTT ATCAGTGCCA 950 

TTTATGATCC AAATGATCAA ACTGCTCGGA AATCACAAAA AGAAGTTGTG 1000 

GGAAATCCTG TGTCTAAAGA TGCAGCGAGC TTGACGCGAA CGCATATGGT 1050 

TTTAGTCGGT ACCGATCCAG TATATGGAAC TATGTATAAT CATAAGACAG 1100 

GGAAACCAAC TGTAACTGTT CCTGGGCAAA ATGTAGCCCT CAAGTCTGGT 1150 

ACGGCTCAGA TTGCCGATGA GAAAAATGGA GGTTACTTAG TTGGTACGAC 12 00 
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CAATTACATT TTTTCGGCTG TATCGATGAA 
TTATTCTCTA TGTGACGGTT CAACAGCCTG 
TTGGGGGAAT TTGCCAATCC TATCTTGGAA 
TTCCCTTAAC CTCCAATCTA CCGCTAAAAC 
5 AAAGCGCTTA TGCCATGCCT AGCATCAAGG 
GCGGAAGCCT TACGTCGCAA TATTGTGCAA 
AACAAAAATT AAAGAATCAT CTGTAGAAGA 
ACCAGCAAGT TCTTCTCTTA TCTGATAAAG 
TATGGTTGGA CAAAAGAGAC TGCTGAGACC 
10 AGAACTTGAA TTTGAAGGTT CGGGCTCTAC 



PCT/CA00/01150 

CCCTGCTGAA AATCCTGATT 1250 

AGCATTATTC AGGTATTCAG 1300 

AGGGCAGTGG CTATGAAAGA 1350 

GTTAAATCAG GTAACCAATC 1400 

ACATTTCACC TGGCGATTTG 1450 

CCAATCGTTG TAGGAACAGG 1500 

AGGGACGGAT CTTGCACCTA 1550 

CAGAGGAAGT TCCAGATATG 1600 

TTTGCTAAGT GGCTCAATAT 1650 

TGTGCAGAAG 1690 



2) INFORMATION FOR SEQ ID NO: 1048 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1682 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1048 

30 AAAAGAGTTG GAAACTGCAG AGGTCAAGGG GATTGATTTT ACAACCAGTC 50 

CCAATCGTAG TTACCCAAAC GGACAATTTG CTTCTAGCTT TATTGGCTTA 100 

GCCCAACTTC ATGAAAATGA GGATGGTAGT AAGAGTTTGT TAGGGACTTC 150 

TGGTTTGGAG AGTTCTTTAA ATACCATTCT TGCTGGGACA GACGGTATTA 2 00 

TTACCTATGA AAAAGACCGT GTAGGAAATA TCGTACCAGG TACAGAACTG 250 

35 GTATCGCAAC AAACTGTGGA TGGCAAGGAT GTTTATACAA CATTGTCTAG 300 

TCCGCTACAA TCTTTCATGG AAACTCAGAT GGATGCCTTT CTAGAAAAAG 350 

TAAAAGGTAA GTATATGACC GCGACCTTGG TCAGTGCAAA GACCGGTGAA 400 

ATCCTCGCTA CCACCCAACG ACCTACCTTT AATGCAGATA CTAAAGAAGG 450 

AATCACTGAG GACTTTGTTT GGCGTGATAT TCTTTATCAA AGTAACTATG 500 

4 0 AACCAGGATC AGCCATGAAG GTTATGACGT TAGCTTCTTC TATTGATAAT 550 

AATACCTTCC CAAGTGGAGA ATACTTCAAT AGCAGTGAAT TCAAAATAGC 600 

GGATGCGACG ACTCGAGATT GGGATGTTAA TGAAGGTTTG ACTACTGGTG 650 

GGATGATGAC TTTCTTACAA GGTTTCGCTC ACTCCAGTAA TGTTGGAATG 700 

AGTCTACTTG AACAAAAAAT GGGAGATGCT ACTTGGTTGG ATTATCTAAA 750 

45 ACGCTTTAAA TTTGGGGTTC CAACTCGCTT TGGCTTGACA GATGAATACG 800 

CTGGTCAACT TCCAGCTGAT AATATTGTTA GTATTGCTCA AAGCTCATTT 850 

GGGCAAGGAA TTTCAGTGAC ACAAACACAA ATGCTTCGTG CCTTTACAGC 900 

TATTGCTAAT GATGGAGTTA TGCTGGAGCC AAAATTTATA AGTGCTATTT 950 

ATGATACTAA CAATCAGTCT GTACGTAAGT CACAAAAAGA AATAGTAGGA 1000 

50 AATCCTGTTT CCAAAGAGGC AGCAAGCACA ACTCGAAATC ACATGATCTT 1050 

AGTTGGGACG GACCCTCTAT ATGGAACTAT GTATAATCAC TACACAGGAA 1100 

AGCCAATTAT AACAGTTCCT GGACAAAATG TAGCAGTTAA ATCCGGTACG 1150 

GCTCAAATCG CTGATGAGAA AAATGGAGGA TACTTGGTTG GTTCTACCAA 1200 

TTATATTTTC TCAGTTGTGA CTATGAATCC TGCTGAAAAT CCTGATTTTA 1250 

55 TCTTGTATGT AACGGTTCAA CAGCCTGAGC ATTATTCAGG TATCCAGTTG 1300 

GGAGAATTTG CCACCCCAAT CTTGGAGCGG GCTTCAGCTA TGAAAGAATC 1350 

TCTCAATCTT CAATCTCCAG CCAAAAATTT AGATAAAGTT ACGACAGAAT 1400 

CTTCTTATGC AATGCCTAGC ATCAAGGATA TTTCACCTGG TGAGTTGGCG 1450 

GAAGCCTTAC GCCGAAATAT TGTGCAACCA ATCGTTGTAG GTACTGGAAC 1500 

60 AAAGATTAAA GAGACTTCTG TAGAAGAAGG GACCAATCTT GCACCAAACC 1550 
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AACAAGTTCT CCTTTTATCG GATAAGGTAG 
GGCTGGAAAA AAGAGACTGC TGAAACCTTT 
GTTGGAATTT GAAGGTTCAG GTTCCGTCGT 
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AAGAAATTCC AGACATGTAT 1600 
GCTAAATGGT TGGATATTGA 1650 
TC 1682 



2) INFORMATION FOR SEQ ID NO: 1049 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1241 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R690 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 104 9 

GCAGCGTTGC GTGATACCGT TGAAAAAACC ATTAAAAACT GTTTGGATTT 50 

TGAAAGGAGA CAGGAGCATG AATAGAATAA AAGTTGCAAT ACTGTTTGGG 100 

GGTTGCTCAG AGGAGCATGA CGTATCGGTA AAATCTGCAA TAGAGATAGC 150 

CGCTAACATT AATAAAGAAA AATACGAGCC GTTATACATT GGAATTACGA 200 

AATCTGGTGT ATGGAAAATG TGCGAAAAAC CTTGCGCGGA ATGGGAAAAC 250 

GACAATTGCT ATTCAGCTGT ACTCTCGCCG GATAAAAAAA TGCACGGATT 300 

ACTTGTTAAA AAGAACCATG AATATGAAAT CAACCATGTT GATGTAGCAT 350 

TTTCAGCTTT GCATGGCAAG TCAGGTGAAG ATGGATCCAT ACAAGGTCTG 400 

TTTGAATTGT CCGGTATCCC TTTTGTAGGC TGCGATATTC AAAGCTCAGC 450 

AATTTGTATG GACAAATCGT TGACATACAT CGTTGCGAAA AATGCTGGGA 500 

TAGCTACTCC CGCCTTTTGG GTTATTAATA AAGATGATAG GCCGGTGGCA 550 

GCTACGTTTA CCTATCCTGT TTTTGTTAAG CCGGCGCGTT CAGGCTCATC 600 

CTTCGGTGTG AAAAAAGTCA ATAGCGCGGA CGAATTGGAC TACGCAATTG 650 

AATCGGCAAG ACAATATGAC AGCAAAATCT TAATTGAGCA GGCTGTTTCG 700 

GGCTGTGAGG TCGGTTGTGC GGTATTGGGA AACAGTGCCG CGTTAGCTGT 750 

TGGCGAGGTG GACCAAATCA GGCTGCAGTA CGGAATCTTT CGTATTCATC 800 

AGGAAGTCGA GCCGGAAAAA GGCTCTGAAA ACGCAGTTAT AACCGTTCCC 850 

GCAGACCTTT CAGCAGAGGA GCGAGGACGG ATACAGGAAA CGGCAAAAAA 900 

AATATATAAA GCGCTCGGCT GTAGAGGTCT AGCCCGTGTG GATATGTTTT 950 

TACAAGATAA CGGCCGCATT GTACTGAACG AAGTCAATAC TCTGCCCGGT 1000 

TTCACGTCAT ACAGTCGTTA TCCCCGTATG ATGGCCGCTG CAGGTATTGC 1050 

ACTTCCCGAA CTGATTGACC GCTTGATCGT ATTAGCGTTA AAGGGGTGAT 1100 

AAGCATGGAA ATAGGATTTA CTTTTTTAGA TGAAATAGTA CACGGTGTTC 1150 

GTTGGGACGC TAAATATGCC ACTTGGGATA ATTTCACCGG AAAACCGGTT 12 00 

GACGGTTATG AAGTAAATCG CATTGTAGGG ACATACGAAT T 1241 



2 ) INFORMATION FOR SEQ ID NO: 1050 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1249 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



593 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R691 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1050 

TCACACCGCA TACGGCCTAT TATACCGAGC AAGCGTTGCG TGATACCGTT 50 

GAAAAAACCA TTAAAAACTG TTTGGATTTT GAAAGGAGAC AGGAGCATGA 100 

ATAGAATAAA AGTTGCAATA CTGTTTGGGG GTTGCTCAGA GGAGCATGAC 150 

GTATCGGTAA AATCTGCAAT AGAGATAGCC GCTAACATTA ATAAAGAAAA 200 

ATACGAGCCG TTATACATTG GAATTACGAA ATCTGGTGTA TGGAAAATGT 250 

GCGAAAAACC TTGCGCGGAA TGGGAAAACG ACAATTGCTA TTCAGCTGTA 3 00 

CTCTCGCCGG ATAAAAAAAT GCACGGATTA CTTGTTAAAA AGAACCATGA 3 50 

ATATGAAATC AACCATGTTG ATGTAGCATT TTCAGCTTTG CATGGCAAGT 400 

CAGGTGAAGA TGGATCCATA CAAGGTCTGT TTGAATTGTC CGGTATCCCT 450 

TTTGTAGGCT GCGATATTCA AAGCTCAGCA ATTTGTATGG ACAAATCGTT 500 

GACATACATC GTTGCGAAAA ATGCTGGGAT AGCTACTCCC GCCTTTTGGG 550 

TTATTAATAA AGATGATAGG CCGGTGGCAG CTACGTTTAC CTATCCTGTT 600 

TTTGTTAAGC CGGCGCGTTC AGGCTCATCC TTCGGTGTGA AAAAAGTCAA 650 

TAGCGCGGAC GAATTGGACT ACGCAATTGA ATCGGCAAGA CAATATGACA 700 

GCAAAATCTT AATTGAGCAG GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG 750 

GTATTGGGAA ACAGTGCCGC GTTAGCTGTT GGCGAGGTGG ACCAAATCAG 800 

GCTGCAGTAC GGAATCTTTC GTATTCATCA GGAAGTCGAG CCGGAAAAAG 850 

GCTCTGAAAA CGCAGTTATA ACCGTTCCCG CAGACCTTTC AGCAGAGGAG 900 

CGAGGACGGA TACAGGAAAC GGCAAAAAAA ATATATAAAG CGCTCGGCTG 950 

TAGAGGTCTA GCCCGTGTGG ATATGTTTTT ACAAGATAAC GGCCGCATTG 1000 

TACTGAACGA AGTCAATACT CTGCCCGGTT TCACGTCATA CAGTCGTTAT 1050 

CCCCGTATGA TGGCCGCTGC AGGTATTGCA CTTCCCGAAC TGATTGACCG 1100 

CTTGATCGTA TTAGCGTTAA AGGGGTGATA AGCATGGAAA TAGGATTTAC 1150 

TTTTTTAGAT GAAATAGTAC ACGGTGTTCG TTGGGACGCT AAATATGCCA 12 00 

CTTGGGATAA TTTCACCGGA AAACCGGTTG ACGGTTATGA AGTAAATCG 1249 



2) INFORMATION FOR SEQ ID NO: 1051 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1272 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R4 81 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1051 

TATACCGAGC AAGCGTTGCG TGATACCGTT GAAAAAACCA TTAAAAACTG 50 

TTTGGATTTT GAAAGGAGAC AGGAGCATGA ATAGAATAAA AGTTGCAATA 100 

CTGTTTGGGG GTTGCTCAGA GGAGCATGAC GTATCGGTAA AATCTGCAAT 150 

AGAGATAGCC GCTAACATTA ATAAAGAAAA ATACGAGCCG TTATACATTG 200 

GAATTACGAA ATCTGGTGTA TGGAAAATGT GCGAAAAACC TTGCGCGGAA 2 50 

TGGGAAAACG ACAATTGCTA TTCAGCTGTA CTCTCGCCGG ATAAAAAAAT 3 00 

GCACGGATTA CTTGTTAAAA AGAACCATGA ATATGAAATC AACCATGTTG 3 50 

ATGTAGCATT TTCAGCTTTG CATGGCAAGT CAGGTGAAGA TGGATCCATA 4 00 

CAAGGTCTGT TTGAATTGTC CGGTATCCCT TTTGTAGGCT GCGATATTCA 450 

AAGCTCAGCA ATTTGTATGG ACAAATCGTT GACATACATC GTTGCGAAAA 500 
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ATGCTGGGAT AGCTACTCCC GCCTTTTGGG 
CCGGTGGCAG CTACGTTTAC CTATCCTGTT 
AGGCTCATCC TTCGGTGTGA AAAAAGTCAA 
ACGCAATTGA ATCGGCAAGA CAATATGACA 
GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG 
GTTAGTTGTT GGCGAGGTGG ACCAAATCAG 
GTATTCATCA GGAAGTCGAG CCGGAAAAAG 
ACCGTTCCCG CAGACCTTTC AGCAGAGGAG 
GGCAAAAAAA ATATATAAAG CGCTCGGCTG 
ATATGTTTTT ACAAGATAAC GGCCGCATTG 
CTGCCCGGTT TCACGTCATA CAGTCGTTAT 
AGGTATTGCA CTTCCCGAAC TGATTGACCG 
AGGGGTGATA AGCATGGAAA TAGGATTTAC 
ACGGTGTTCG TTGGGACGCT AAATATGCCA 
AAACCGGTTG ACGGTTATGA AGTAAATCGC 
GGCTTGAATC GCTTTTTGAA GG 
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TTATTAATAA AGATGATAGG 550 

TTTGTTAAGC CGGCGCGTTC 600 

TAGCGCGGAC GAATTGGACT 650 

GCAAAATCTT AATTGAGCAG 700 

GTATTGGGAA ACAGTGCCGC 750 

GCTGCAGTAC GGAATCTTTC 800 

GCTCTGAAAA CGCAGTTATA 850 

CGAGGACGGA TACAGGAAAC 900 

TAGAGGTCTA GCCCGTGTGG 950 

TACTGAACGA AGTCAATACT 1000 

CCCCGTATGA TGGCCGCTGC 1050 

CTTGATCGTA TTAGCGTTAA 1100 

TTTTTTAGAT GAAATAGTAC 1150 

CTTGGGATAA TTTCACCGGA 1200 

ATTGTAGGGA CATACGAATT 1250 

1272 



2) INFORMATION FOR SEQ ID NO: 1052 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1237 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R492 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1052 

TCCCCCGGCA TACGGCCTAT TATACCGAGC AAGCGTTGCG TGATACCGTT 50 

GAAAAAACCA TTAAAAACTG TTTGGATTTT GAAAGGAGAC AGGAGCATGA 100 

ATAGAATAAA AGTTGCAATA CTGTTTGGGG GTTGCTCAGA GGAGCATGAC 150 

GTATCGGTAA AATCTGCAAT AGAGATAGCC GCTAACATTA ATAAAGAAAA 2 00 

ATACGAGCCG TTATACATTG GAATTACGAA ATCTGGTGTA TGGAAAATGT 250 

GCGAAAAACC TTGCGCGGAA TGGGAAAACG ACAATTGCTA TTCAGCTGTA 3 00 

CTCTCGCCGG ATAAAAAAAT GCACGGATTA CTTGTTAAAA AGAACCATGA 350 

ATATGAAATC AACCATGTTG ATGTAGCATT TTCAGCTTTG CATGGCAAGT 400 

CAGGTGAAGA TGGATCCATA CAAGGTCTGT TTGAATTGTC CGGTATCCCT 450 

TTTGTAGGCT GCGATATTCA AAGCTCAGCA ATTTGTATGG ACAAATCGTT 500 

GACATACATC GTTGCGAAAA ATGCTGGGAT AGCTACTCCC GCCTTTTGGG 550 

TTATTAATAA AGATGATAGG CCGGTGGCAG CTACGTTTAC CTATCCTGTT 600 

TTTGTTAAGC CGGCGCGTTC AGGCTCATCC TTCGGTGTGA AAAAAGTCAA 650 

TAGCGCGGAC GAATTGGACT ACGCAATTGA ATCGGCAAGA CAATATGACA 700 

GCAAAATCTT AATTGAGCAG GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG 750 

GTATTGGGAA ACAGTGCCGC GTTAGCTGTT GGCGAGGTGG ACCAAATCAG 800 

GCTGCAGTAC GGAATCTTTC GTATTCATCA GGAAGTCGAG CCGGAAAAAG 850 

GCTCTGAAAA CGCAGTTATA ACCGTTCCCG CAGACCTTTC AGCAGAGGAG 900 

CGAGGACGGA TACAGGAAAC GGCAAAAAAA ATATATAAAG CGCTCGGCTG 950 

TAGAGGTCTA GCCCGTGTGG ATATGTTTTT ACAAGATAAC GGCCGCATTG 1000 

TACTGAACGA AGTCAATACT CTGCCCGGTT TCACGTCATA CAGTCGTTAT 1050 

CCCCGTATGA TGGCCGCTGC AGGTATTGCA CTTCCCGAAC TGATTGACCG 1100 

CTTGATCGTA TTAGCGTTAA AGGGGTGATA AGCATGGAAA TAGGATTTAC 1150 

TTTTTTAGAT GAAATAGTAC ACGGTGTTCG TTGGGACGCT AAATATGCCA 1200 

CTTGGGATAA TTTCACCGGA AAACCGGTTG ACGGTTA 1237 
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